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STREPTOCOCCAL HEAT SHOCK PROTEINS OF THE HSP6Q FAMIL Y 

TECHNICAL FIELD OF THE INVENTION 

This invention relates to Streptococcal Hsp60 proteins, including 
5 fragments thereof, and nucleic acid molecules encoding such proteins and fragments, in 
particular from Streptococcus pneumoniae and Streptococcus pyogenes, and uses of 
such proteins and nucleic acid molecules. 

"R ACKTrROI JND OF T HE INVENTION 
10 The World Health Organization has estimated that, worldwide, about 

30% of deaths of children under age 5, or about 4-5 million, result from acute 
respiratory infections. David Klein, Pneumococcal Conjugate Vaccines: Review and 
Update, in Microbial Drug Resistance 7:49, 1995. The most frequent causative agent 
responsible for these deaths is Streptococcus pneumoniae, which is also referred to as 
15 pneumococcus and causes a wide variety of infections such as sinusitis, otitis, 
pneumonia, bacteremia and meningitis. This organism is found on respiratory mucosal 
membranes of 15-35% of healthy children and up to 80% of children with respiratory 
infections. Gray et al., J. Infect Dis. 142:923, 1980; Hendley et al., J. Infect. Dis. 
732:55, 1975. In addition, Streptococcus pneumoniae is responsible for 70,000 
20 meningitis deaths and a similar number of deaths from sepsis and other infections. 

In developing countries, Pneumococcal infections are responsible for 
approximately 1.2 million deaths among children 5 years of age and younger, which 
corresponds to nearly 40% of all pneumonia related deaths. World Health 
Organization, Pneumococcal Conjugate Vaccines, reported in Report of Meeting of 
25 November 15-17, 1993 (WHO/ARI/94.34). In the industrialized world, taking the U.S. 
as an example, pneumococcus is a leading cause of severe morbidity in the general 
population and of death in the elderly as well as the immunocompromised population. 
Klein et al. (supra). Pneumococcus causes more deaths (about 50,000) in older adults 
than any other infectious agent. High risk individuals include those with sickle cell 
30 anemia, nephrotic syndrome, asplenia, alcoholism and HIV infection. Pneumococcus 
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also poses a large risk to children under the age of two. In infants below the age of two, 
pneumococcus is the predominant cause of meningitis, bacteremia and otitis media. 
Within the first two years of life, about 25% of children experience otitis media caused 
by pneumococcus, a percentage that increases to 75% by the age of six. In Finland, 
5 two-year old children have experienced, on average, more than one episode of otitis 
media. About half of the cases of acute otitis media were determined to be caused by 
pneumococcus. Eskola, J. and Kaeyhty, H., Ann. Med. 27:53, 1995. 

Pneumococcus is a gram-positive organism that has type-specific 
capsular polysaccharides. Eighty-three different type specificities have been identified 
10 and have been designated 1-83 in the American system. Jennings, Current Topics in 
Microbiology and Immunology 750:97-121, 1990. The structures of the different 
Pneumococcal polysaccharides have been reviewed by Kenne and Lindberg in The 
Polysaccharides 2:282-363 (Aspinal ed., 1983). 

The need for an effective way to generate an immune response against 
15 Streptococcus pneumoniae was recognized long ago. In 1945 it was demonstrated that 
isolated capsular polysaccharides were able to provide type-specific protection in 
humans. MacLeod et al., J. Exp. Med. 52:445-65, 1945. However, this protection was 
inadequate due to the large number of different polysaccharides needed for complete 
protection. The interest in a vaccine soon subsided due to the success of antibiotic 
20 treatment of infections. 

Recently the interest in the development of an effective vaccine has 
renewed. One reason was that antibiotic treatment of infectious diseases caused by 
encapsulated bacteria such as pneumococcus did not always prevent morbidity and 
mortality. For an analogous example, cured Hemophilus influenzae meningitis was a 
25 major cause of acquired mental retardation. Sell et al., Pediatrics 49:206-11, 1972. 
Another important reason for the renewed interest in vaccine development was the 
appearance and rapid spread of antibiotic-resistant strains of pneumococcus. For 
example, in two hospitals in Paris, France, the frequency of resistant isolates from 
patients increased from 1.8% in 1987 to 17% in 1990. In Barcelona, Spain, the rate of 
30 resistance increased from 4.3% in 1979 to 40% in 1990. See Lonks and Medeiros, 
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Antimicrobial Therapy 1 79:523-35, 1995. Multidrug-resistant pneumococcus have 
also appeared in many countries including 18 of the 50 states of the United States. 

A vaccine containing polysaccharide antigens for 14 of the 83 capsular 
types was developed and released in 1978. Lonks and Medeiros, supra. This vaccine, 

5 was improved in 1983 by the creation of second generation vaccine containing 23 
different polysaccharides. However, two large studies, using this vaccine, one with 
2837 patients, showed that the improved vaccine was only about 57% efficacious 
against Pneumococcal bacteremia. Butler et b1.,JAMA 270:1826, 1993. 

A drawback to polysaccharide-based vaccines is that the efficacy of 

10 these vaccines is problematic in infants under two years of age, who respond very 
poorly to these vaccines. Gotschlich et al., Antibodies in Human Diagnosis and 
Therapy 391-402 (Haber and Krause eds., 1977). An additional drawback is that 
antibodies produced by polysaccharide-based vaccines are predominantly of the IgM 
isotype, and therefore the immune response is not heightened upon secondary exposure 

15 to the antigen. 

These and other concerns about polysaccharide-based vaccines 
demonstrate that there is a need in the art for improved compositions which can be used 
to generate an immunogenic response directed to Streptococcus pneumoniae. 

Turning to Streptococcus pyogenes, also referred to as group A 
20 streptococcus ("GAS"), it too is a gram-positive bacterium that is causatively associated 
with a number of human disease states, ranging from acute pharyngitis (strep throat) to 
invasive diseases involving degeneration of the heart valves (acute rheumatic fever) and 
acute post-Streptococcal glomerulonephritis. Facklam, Development of Group A 
Streptococcal Vaccines, in Manual of Clinical Microbiology 1-22 (Lennette, Balows, 
25 Hausler and Truant eds., 1980). Infection by this bacterium can also cause impetigo (a 
supporative mucosal infection), invasive fasciitis (viz. flesh-eating disease), boils and 
skin abscesses (pyoderma), scarlet fever, sepsis, a severe toxic-shock like syndrome and 
pneumonia. 

Before the advent of antibiotic therapy, rheumatic fever was a leading 
30 cause of mortality in children and of chronic heart disease in individuals who survived 
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systemic infection. In developing countries, rheumatic fever is still an enormous 
problem. It has been estimated that in India over 6 million school age children suffer 
from rheumatic heart disease. Agarwal, Lancet 1 910-11, 1981. In the United States, 
the CDC has estimated that 25-40 million cases of Streptococcus pyogenes-mduced 
pharyngitis occur every year, costing over $2 billion for physician visits, culture work 
and antibiotic therapy. There also has been an increase in toxic-shock like syndrome 
caused by the organism. Presently, 10,000-15,000 cases of Streptococcal and 
Staphylococcal Toxic shock like infections occur annually in the United States. While 
presently GAS infections are treated with antibiotics, given what is known about other 
bacteria including pneumococcus (as detailed above), the proliferation of antibiotic- 
resistant strains is a concern. 

GAS are differentiated from other streptococci by their Group A 
carbohydrate, a cell wall moiety containing rhamnose and N-acetyl glucosamine. 
Different strains of GAS are classified, serologically, based on their M protein or on the 
T antigen. GAS can be assigned to 80-100 different M protein groups which form the 
principal basis for characterizing pathological strains. The M protein is a surface 
protein and is both a major virulence factor and a major protective antigen. Lancefield, 
J. Immunol. 59:307, 1962. Antibodies against M protein are opsonic and promote 
killing of the bacteria by phagocytes. Lancefield, supra. 

While M proteins are potentially useful in the constitution of a vaccine, 
several obstacles remain on the route to an effective vaccine. First, the M protein 
contains epitopes that cross-react with human tissue, especially the myocardium. Dale 
and Beachy, J. Exp. Med. 7(57:113, 1985. Thus, anti-M protein antibodies may cause 
disease rather than preventing it. Second, it may not be practical to produce a vaccine 
against all 80-100 different strains of GAS. Any vaccine containing only a few types of 
M protein may be only partially effective. While the first problem might be overcome 
by using M protein fragments that lack the cross-reactive epitopes as immunogens (Dale 
et al., V. Immunol. 757:2188-94, 1993), such an approach has not yet been proven, and 
the latter problem of immunizing against numerous distinct M proteins still needs to be 
overcome. Accordingly, there is a need in the art for a composition which provides 



1 



WO 99/35270 



PCT/CA98/01203 



nucleotide bases of any one of SEQ ID NO:l from nucleotides 15-1652, SEQ ID NO:3 
from nucleotides 15-1640, SEQ ID NO:5 from nucleotides 15-1649, or SEQ ID NO:7 
from nucleotides 15-1652 or a complement thereof or an isolated nucleic acid molecule 
encoding Hsp60 comprising a nucleic acid sequence that encodes a polypeptide 

5 comprising any one of SEQ ID NOS: 2, 4, 6 or 8 or a variant Hsp60 that is at least 95% 
homologous to a polypeptide according to any one of SEQ ID NOS: 2, 4, 6 or 8. 

In one embodiment, the present invention provides an isolated nucleic 
acid molecule according as described above, the molecule encoding a polypeptide that 
is able to be selectively bound by an antibody specific for a Streptococcus pneumoniae 

10 Hsp60 or a Streptococcus pyogenes Hsp60. 

In still another aspect in one aspect the present invention provides an 
isolated nucleic acid molecule encoding at least 8 amino acids of a Streptococcal Hsp60 
polypeptide selected from amino acid residues 1-545 of SEQ ID NO:2, amino acid 
residues 1-541 of SEQ ID NO:4, amino acid residues 1-544 of SEQ ID NO:6, and 

15 amino acid residues 1-545 of SEQ ED NO: 8, wherein the encoded Streptococcal Hsp60 
polypeptide is able to bind to a major histocompatibility complex. 

In still further aspects the present invention provides an isolated 
Streptococcus pneumoniae Hsp60 polypeptide and an isolated Streptococcus pyogenes 
Hsp60 polypeptide. 

20 In some embodiments, the isolated Hsp60 polypeptide comprises the 

amino acid sequence of any one of a Streptococcal Hsp60 polypeptide selected from 
amino acid residues 1-545 of SEQ ID NO:2, amino acid residues 1-541 of SEQ ID 
NO:4, amino acid residues 1-544 of SEQ ID NO:6, and amino acid residues 1-545 of 
SEQ ID NO: 8, or variants thereof, preferably wherein the polypeptide is able to be 

25 selectively bound by an antibody specific for either a Streptococcus pneumoniae Hsp60 
and/or Streptococcus pyogenes Hsp60. In further embodiments, the isolated Hsp60 
polypeptide is fused to an additional polypeptide to create a fusion protein. 

In still yet further aspects the present invention provides an isolated 
Hsp60 polypeptide comprising at least 8 amino acids selected from amino acid residues 

30 1-545 of SEQ ID NO:2, amino acid residues 1-541 of SEQ ID NO:4, amino acid 
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residues 1-544 of SEQ ID NO:6, and amino acid residues 1-545 of SEQ ID NO:8, 
wherein the Hsp60 polypeptide is capable of binding to a major histocompatibility 
complex and eliciting or enhancing an immune response to Streptococcus in a human 
being. 

5 In certain embodiments, the isolated Hsp60 polypeptide is derived from 

proteolytic cleavage or chemical synthesis, is an expression product of a transformed 
host cell containing a nucleic acid molecule encoding the Hsp60 or portion thereof In 
further certain embodiments, the isolated Hsp60 polypeptide comprises greater than 
95% homology to any one of a Streptococcal Hsp60 polypeptide selected from amino 

10 acid residues 1-545 of SEQ ID NO:2, amino acid residues 1-5410 of SEQ ID NO:4, 
amino acid residues 1-544 of SEQ ID NO:6, and amino acid residues 1-545 of SEQ ID 
NO:8, and wherein the Hsp60 polypeptide is able to be selectively bound by an 
antibody specific for either a Streptococcus pneumoniae Hsp60 or Streptococcus 
pyogenes Hsp60 or both. 

15 In still yet another aspect the present invention provides an isolated 

polypeptide wherein the polypeptide is an expression product of a transformed host cell 
containing one or more of the nucleic acid molecules described herein. 

In still yet further aspects the present invention provides vectors 
comprising one or more of the nucleic acid molecules described herein. In certain 

20 embodiments, the vector is an expression vector comprising a promoter in operative 
linkage with the isolated nucleic acid molecule encoding the Hsp60 or portion thereof, 
preferably further comprising a selectable or identifiable marker and/or wherein the 
promoter is a constitutive or an inducible promoter. The present invention also 
provides host cells containing such vectors. In certain embodiments, the host cell is 

25 selected from the group consisting of a bacterial cell, a mammalian cell, a yeast cell and 
an insect cell. 

In still yet other aspects the present invention provides compositions 
comprising an Hsp60 polypeptide as described herein in combination with a 
pharmaceutically acceptable carrier or diluent. In certain embodiments, the 
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composition is suitable for systemic administration, oral administration or parenteral 
administration. 

In yet other aspects the present invention provides methods for eliciting 
or enhancing an immune response in a mammal against Streptococcus, comprising 
5 administering to the mammal an effective amount of an Hsp60 polypeptide as described 
herein in combination with a pharmaceutically acceptable carrier or diluent, methods for 
eliciting or enhancing an immune response in a mammal against a target antigen 
comprising administering to the mammal the target antigen joined to an Hsp60 
polypeptide as described herein in combination with a pharmaceutically acceptable 
1 0 carrier or diluent. 

In another aspect the present invention provides compositions 
comprising an isolated nucleic acid molecule as described herein wherein the isolated 
nucleic acid molecule encodes a polypeptide having at least one amino acid difference 
from a corresponding polypeptide of an Hsp60 protein from an organism other than 
1 5 Streptococcus, 

These and other aspects of the present invention will become evident 
upon reference to the present specification and the attached drawings. In addition, 
various references are set forth herein that describe in more detail certain procedures or 
compositions (e.g., plasmids, etc.); all such references are incorporated herein by 
20 reference in their entirety. 

BRIEF DESCR IPTION OF THE FIGURES 

Figure 1 depicts the nucleotide and amino acid sequences of 
Streptococcus pneumoniae Hsp60-1 gene (SEQ ID NOS: 1 and 2 respectively). 
25 Figure 2 depicts the nucleotide and amino acid sequences of 

Streptococcus pneumoniae Hsp60-2 gene (SEQ ID NOS: 3 and 4 respectively). 

Figure 3 depicts the nucleotide and amino acid sequences of 
Streptococcus pyogenes Hsp60-1 gene (SEQ ID NOS: 5 and 6 respectively). 

Figure 4 depicts the nucleotide and amino acid sequences of 
30 Streptococcus pyogenes Hsp60-2 gene (SEQ ID NOS: 7 and 8 respectively). 
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Figure 5 is a schematic representation of the sequencing strategy used to 
deduce the sequences of the Hsp60 genes from S. pneumoniae and S. pyogenes. 

Figures 6-9 depict maps of expression vectors pETP60-l, pETP60-2, 
pETY60-l, and pETY60-2, which vectors include the Hsp60 genes from S. pneumoniae 
5 or S. pyogenes, respectively. 

Figure 10 depicts a comparison of the S. pneumoniae (SEQ ID NOS: 2 
and 4) and S. pyogenes (SEQ ID NOS: 6 and 8) Hsp60 genes with similar genes from 
other organisms (SEQ ID NOS: 9 through 34). 

Figure 1 1 depicts RP-HPLC chromatograms of the Hsp60 genes from S. 
1 0 pneumoniae and S. pyogenes, 

DETAILED DE SCRIPTION OF INVENTION 

The present invention provides methods and compositions comprising 
isolated nucleic acid molecules and polypeptides specific to Streptococcus pneumoniae 

15 and Streptococcus pyogenes, as well as vector constructs, antibodies and other materials 
related to isolated nucleic acid molecules and polypeptides. Such compositions and 
methods are useful for the diagnosis of Streptococcal infection and for generating an 
immune response to Streptococcal bacteria. 

A "stress gene," also known as "heat shock gene," is a gene that is 

20 activated or otherwise detectably upregulated due to the contact or exposure of an 
organism (containing the gene) to a stressor, such as heat shock or glucose deprivation 
or glucose addition. A given "stress gene" also includes homologous genes within 
known stress gene families, such as certain genes within the Hsp60, Hsp70 and Hsp90 
stress gene families, even though such homologous genes are not themselves induced 

25 by a stressor. As defined herein, a "stress protein," also known as a "heat shock 
protein," ("Hsp") is a protein that is encoded by a stress gene, and is therefore typically 
produced in significantly greater amounts upon the contact or exposure to the stressor of 
the organism. Each of the terms stress gene and stress protein as used in the present 
specification are inclusive of the other, unless the context indicates otherwise. 

30 Streptococcal Hsps, as well as Hsps from other organisms, appear to participate in 
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important cellular processes such as protein synthesis and assembly and disassembly of 
protein complexes. 

A variety of stress genes and proteins are well known in the art and 
include, for example, HsplOO-200, HsplOO, Hsp90, Lon, Hsp70, Hsp60, TF55, Hsp40, 
FKBPs, cyclophilins, Hsp20-30, ClpP, GrpE, Hsp 10, ubiquitin, calnexin, peptidyl- 
prolyl cis-trans isomerases, and protein disulfide isomerases. Macario, A.J.L., Int. 
J. Clin. Lab. Res. 25:59-70, 1995; Parsell, D.A., & Lindquist, S., Ann. Rev. Genet. 27: 
437-496 (1993); U.S. Patent No. 5,232,833 (Sanders et al.). 

In bacteria, the predominant stress proteins are proteins with molecular 
sizes of about 60 and 70 kDa (i.e., Hsp60 and Hsp70, respectively). Hsp70 and Hsp60 
typically represent about 1-3% of bacterial cell protein based on the staining pattern 
using sodium dodecyl sulfate-polyacrylamide gel electrophoresis ("SDS-PAGE") and 
the stain coomassie blue, but accumulate to levels as high as 25% under stressful 
conditions. Thus, Hsps are produced in an invading bacterium due to stresses put on the 
bacterium by the environment of the animal, and the Hsps become some of the most 
significant bacterial antigens displayed to the host and to which the host mounts an 
immune response. Therefore, by administering a Streptococcal Hsp to an animal, the 
Streptococcal Hsp can induce an immune response in the animal to Streptococcus, 
preferably providing resistance to such a bacterial infection. Accordingly, the isolation 
of Streptococcal Hsp60 genes provides a platform for the generation of isolated 
polypeptides or fragments or variants of Streptococcal Hsp60 useful in diagnosis and 
inhibition of Streptococcal associated disorders. 

As used herein, "polypeptide" refers to full length proteins and 
fragments thereof. 

As used herein, "peptide" refers to a fragment of the whole protein, 
whether chemically or biologically produced. 

As used herein, "immunogenic" refers to an antigen or composition that 
elicits an immune response. 

An "isolated nucleic acid molecule" refers to a polynucleotide molecule, 
in the form of a separate fragment or as a component of a larger nucleic acid construct, 
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that has been separated from its source cell (including the chromosome it normally 
resides in) at least once in a substantially pure form. Nucleic acid molecules can be 
comprised of a wide variety of nucleotides and molecules well known in the art, 
including DNA, RNA, nucleic acid analogues, or any combination of these. 

5 As used herein, "vector" refers to a polynucleotide assembly capable of 

directing expression and/or replication of the nucleic acid sequence of interest. Such 
assembly can, if desired, be included as a part of other components, such as a protein, 
lipid or lipoprotein coat, for delivery of the vector or for other purposes. 

An "expression vector" refers to polynucleotide vector having at least a 

10 promoter sequence operably linked to the nucleic acid sequence of interest. 

As used herein, a "promoter" refers to a nucleotide sequence that 
contains elements that direct the transcription of an operably linked nucleic acid 
sequence. At minimum, a promoter contains an RNA polymerase binding site. 
Promoter regions can also contain enhancer elements which by definition enhance 

1 5 transcription. 

A. Hsp60 Genes And Polypeptides From Streptococcus 

PNEUMONIAE AND STREPTOCOCCUS PYOGENES 

As used herein, Hsp60 refers to heat shock genes from the Hsp60 family 
20 of genes that produce heat shock proteins of approximately 60kDa; the nucleotide and 
amino acid sequences of Hsp60 genes and gene products from Streptococcus 
pneumoniae and Streptococcus pyogenes are set forth in Figures 1-4 (SEQ ID NOS:l-8; 
such sequences also include the PCR primers used to isolate the Hsp60 genes). Within 
the context of this invention it should be understood that Hsp60 includes wild- 
25 type/native protein sequences, as well as other variants (including alleles) and fragments 
of the native protein sequence. Briefly, such variants may result from natural 
polymorphisms or be synthesized by recombinant methodology or chemical synthesis, 
and differ from wild-type proteins by one or more amino acid substitutions, insertions, 
deletions, or the like. Further, in the region of homology to the native sequence, 
30 variants should preferably have at least 95% amino acid sequence homology, and within 
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certain embodiments, greater than 97% or 98% homology. As will be appreciated by 
those of ordinary skill in the art, a nucleotide sequence encoding Hsp60 or variant may 
differ from the native sequences presented herein due to codon degeneracies, nucleotide 
polymorphisms, or nucleotide substitutions, deletions or insertions. 
5 An "isolated nucleic acid molecule encoding Streptococcus Hsp60" 

refers to nucleic acid sequences that are capable of encoding Hsp60 polypeptides of 
Streptococcus, preferably Streptococcus pneumoniae or Streptococcus pyogenes. While 
several embodiments of such molecules are depicted in SEQ ID NOS:l-4, it should be 
understood that within the context of the present invention, reference to one or more of 

10 these genes includes variants of the genes, that is, naturally occurring variants or 
sequences that are substantially similar to the genes (and, where appropriate, the protein 
(including peptides and polypeptides) that are encoded by the genes and their variants). 
As used herein, the nucleotide sequence is deemed to be "substantially similar" if: (a) 
the nucleotide sequence is derived from the coding region of a native gene of 

15 Streptococcus and maintains substantially the same biological activity (including, for 
example, portions of the sequence or allelic variations of the sequences discussed 
above); or (b) the nucleotide sequence is capable of hybridization to the nucleotide 
sequences of the present invention under high stringency (i.e., capable of selectively 
hybridizing to nucleotide sequences from Streptococcus); or (c) the nucleotide 

20 sequences are degenerate (i.e., sequences which code for the same amino acid using a 
different codon sequence) as a result of the genetic code to the nucleotide sequences 
defined in (a) or (b); or (d) is a complement of any of the sequences described in (a), (b) 
or (c) 

One aspect of the present invention is the use of Streptococcus Hsp60 
25 nucleotide sequences to produce recombinant proteins for immunizing an animal. 
Therefore, the use of any length of nucleic acid disclosed by the present invention 
(preferably 24 nucleotides or longer) which encodes a polypeptide or fragment thereof 
that is capable of binding to the major histocompatibility complex and eliciting or 
enhancing an immunogenic response is contemplated by this invention. Immunogenic 
30 response can be readily tested by known methods such as challenging a mouse or rabbit 
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with the antigen of interest and thereafter collecting plasma and determining if the 
antibody of interest is present. Other assays particularly useful for the detection of T- 
cell responses include proliferation assays, T-cell cytotoxicity assays and assays for 
delayed hypersensitivity. In determining whether an antibody specific for the antigen of 
5 interest was produced by the animal, many diagnostic tools are available, for example, 
testing binding of labeled antigen to plasma derived antibodies, or using Enzyme- linked 
immunoassays with tag attached to the antigen of interest. 

The Streptococcal Hsp60 genes of this invention can be obtained using a 
variety of methods. For example, a nucleic acid molecule can be obtained from a 

10 cDNA or genomic expression library by screening with an antibody or antibodies 
reactive to one or more of these Hsp60s (see, e.g., Sambrook et al., Molecular Cloning: 
A Laboratory Manual, Cold Spring Harbor, 1989; Ausubel et al., Current Protocols in 
Molecular Biology, Greene Publishing, 1987). Further, random-primed PCR can be 
employed (see, e.g., Methods in EnzymoL 254:215, 1995). In addition, variations of 

1 5 random-primed PCR can also be used, especially when a particular gene or gene family 
is desired. In one such method, one of the primers is a poly deoxy-thymine and the 
other is a degenerate primer based on the amino acid sequence or nucleotide sequence 
of related Hsps. 

Other methods can also be used to obtain a nucleic acid molecule that 
20 encodes Streptococcal Hsp60. For example, a nucleic acid molecule can be obtained by 
using the sequence information provided herein to synthesize a probe which can be 
labeled, such as with a radioactive label, enzymatic label, protein label, fluorescent 
label, or the like, and hybridized to a genomic library or a cDNA library constructed in 
a phage, plasmid, phagemid, or other viral vector (see, e.g., Sambrook et al. (supra); 
25 Ausubel et al. (supra)). DNA representing RNA or genomic nucleic acid sequence can 
also be obtained by amplification using sets of primers complementary to 5' and 3' 
sequences of the cDNA sequence, such as presented in Example 1. For ease of cloning, 
restriction sites can also be incorporated into the primers. 

Variants (including alleles) of the Hsp60 genes provided herein can be 
30 readily isolated from natural variants (e.g., polymorphisms, mutants), synthesized or 
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constructed. Many methods have been developed for generating mutants (see generally 
Sambrook et al. (supra); Ausubel et al. (supra)). Briefly, preferred methods for 
generating nucleotide substitutions utilize an oligonucleotide that spans the base or 
bases to be mutated and contains the mutated base or bases. The oligonucleotide is 
5 hybridized to complementary single stranded nucleic acid and second strand synthesis is 
primed from the oligonucleotide. The double-stranded nucleic acid is prepared for 
transformation into host cells, such as E. colU other prokaryotes, yeast or other 
eukaryotes. Standard screening and vector growth protocols are used to identify mutant 
sequences and obtain high yields. 

10 Similarly, deletions and/or insertions of the Hsp60 gene can be 

constructed by any of a variety of known methods. For example, the gene can be 
digested with restriction enzymes and religated such that sequence is deleted or 
religated with additional sequence such that an insertion or large substitution is made. 
Other means of generating variant sequences, known in the art, can be employed, for 

15 examples see Sambrook et al. (supra) and Ausubel et al. (supra). Moreover, 
verification of variant sequences is typically accomplished by restriction enzyme 
mapping, sequence analysis or hybridization. Variants which encode a polypeptide that 
elicits an immunogenic response specific for Streptococcus are useful in the context of 
this invention. 

20 As noted above, the present invention also provides isolated 

polypeptides. Within the context of the present invention, unless otherwise clear from 
the context, such polypeptides are understood to include the whole, or 
portions/fragments, of a gene product derived from one or more of the Streptococcal 
Hsp60 genes or derivatives thereof as discussed above. In one aspect of the present 

25 invention, the protein is encoded by a portion of a native gene or is encoded by a 
derivative of a native gene and the protein or fragment thereof elicits or enhances an 
immune response specific for Streptococcus. 

A "purified" Hsp60 stress protein of the present invention is a heat shock 
protein of the Hsp60 family from Streptococcus pneumoniae or Streptococcus pyogenes 

30 that has been purified from its producing cell. For example, the Streptococcal Hsp60 
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polypeptides of the present invention can be purified by a variety of standard methods 
with or without a detergent purification step. For example, Streptococcal Hsp60 can be 
isolated by, among other methods, culturing suitable host and vector systems to produce 
recombinant Hsp60 (discussed further herein). Then, supernatants from such cell lines, 

5 or Hsp60 inclusions, or whole cells where the Hsp60 is not excreted into the 
supernatant, can be treated by a variety of purification procedures. For example, the 
Streptococcal Hsp60-containing composition can be applied to a suitable purification 
matrix such as an anti-Hsp60 antibody bound to a suitable support. Alternatively, anion 
or cation exchange resins, gel filtration or affinity, hydrophobic or reverse phase 

10 chromatography may be employed in order to purify the protein. The Hsp60 
polypeptide can also be concentrated using commercially available protein 
concentration filters, such as an Amicon or Millipore Pellicon ultrafiltration unit, or by 
vacuum dialysis. In another alternative the supernatant can first be concentrated using 
one of the above mentioned protein concentration filters, followed by application of the 

1 5 concentrate to a suitable purification matrix such as those described above. 

In one embodiment, the isolated Streptococcal Hsp60s of the present 
invention are produced in a recombinant form, utilizing genetic manipulation 
techniques that are well known in the art. For example, Streptococcal Hsp60 can be 
expressed as a histidine-tagged molecule, permitting purification on a nickel-chelating 

20 matrix. Alternatively, other tags may be used, including FLAG and GST. The 
associated tag can then be removed in the last step of purification, for example, for 
certain vectors, His-tagged proteins may be incubated with thrombin, resulting in 
cleavage of a recognition sequence between the tag and the Hsp60 polypeptide {e.g., 
pET vectors from Invitrogen). Following purification of Streptococcal Hsp60 from a 

25 gram-negative bacterial host, whether tagged or not, it will be necessary to reduce the 
level of endotoxin in the Hsp60 preparation, as discussed above. 

B. Vectors, Host Cells, and Expression Of Streptococcal Hsp60 
It is well known in the art that certain vectors (e.g., pUC) can be used for 
30 producing multiple copies of a nucleotide molecule of interest as well as being useful 
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for genetic manipulation techniques (e.g., site-directed mutagenesis). See Sambrook 
(supra). Of particular interest to this disclosure are expression vectors. The expression 
vector includes transcriptional promoter/enhancer elements operably linked to the 
Streptococcal Hsp60 nucleic acid molecule. The expression vector may be composed of 

5 either deoxyribonucleic acids ("DNA"), ribonucleic acids ("RNA"), or a combination of 
the two (e.g., a DNA-RNA chimera). Optionally, the expression vector may include a 
polyadenylation sequence or one or more restriction sites. Additionally, depending on 
the host cell chosen and the expression vector employed, other genetic elements such as 
an origin of replication, additional nucleic acid restriction sites, enhancers, sequences 

10 conferring inducibility of transcription, and genes encoding proteins suitable for use as 
selectable or identifiable markers, may also be incorporated into the expression vectors 
described herein. 

The manipulation and expression of Streptococcal Hsp60 genes can be 
accomplished by culturing host cells containing an expression vector capable of 

15 expressing the Hsp60 genes. Such vectors or vector constructs include either synthetic 
or cDNA-derived nucleic acid molecules or genomic DNA fragments encoding 
Streptococcal Hsp60 polypeptides, which are operably linked to suitable transcriptional 
or translational regulatory elements. Suitable regulatory elements within the expression 
vector can be derived from a variety of sources, including bacterial, fungal, viral, 

20 mammalian, insect, or plant genes. Selection of appropriate regulatory elements is 
dependent on the host cell chosen, and can be readily accomplished by one of ordinary 
skill in the art in light of the present specification. Examples of regulatory elements 
include a transcriptional promoter and enhancer or RNA polymerase binding sequence, 
a transcriptional terminator, and a ribosomal binding sequence, including a translation 

25 initiation signal. 

Nucleic acid molecules that encode any of the Streptococcal Hsp60 
polypeptides described above can be expressed by a wide variety of prokaryotic and 
eukaryotic host cells, including bacterial, mammalian, yeast or other fungi, viral, insect, 
and plant cells. The selection of a host cell may also assist the production of 

30 glycosolated or non-glycosolated Hsp60s, depending upon the desires of the user. 
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Methods for transforming or transfecting such cells to express nucleic acids are well 
known in the art (see, e.g., Itakura et al., U.S. Patent No. 4,704,362; Hinnen et al., 
PNAS USA 75:1929-1933, 1978; Murray et al., U.S. Patent No. 4,801,542; Upshall et 
al., U.S. Patent No. 4,935,349; Hagen et al., U.S. Patent No. 4,784,950; Axel et al., U.S. 
5 Patent No. 4,399,216; Goeddel et al, U.S. Patent No. 4,766,075; and Sambrook et al., 
Molecular Cloning: A Laboratory Manual, 2nd edition, Cold Spring Harbor 
Laboratory Press, 1989; for plant cells see Czako and Marton, Plant Physiol 104:1067- 
1071, 1994; Paszkowski et aL, Biotech. 24:387-392, 1992). 

Bacterial host cells suitable for carrying out the present invention include 
10 E. coli, such as E. coli DH5a (Stratagene, La Jolla, California), M. leprae, M 
tuberculosis, M. bovis, B. subtilis, Salmonella typhimurium, and various species within 
the genera Pseudomonas, Streptomyces, Streptococcus, and Staphylococcus, as well as 
many other bacterial species well known to one of ordinary skill in the art. 

Bacterial expression vectors preferably comprise a promoter, which 
15 functions in the host cell, one or more selectable phenotypic markers, and a bacterial 
origin of replication. Representative promoters include the P-lactamase (penicillinase) 
and lactose promoter system (see Chang et al., Nature 275:615, 1978), the T7 RNA 
polymerase promoter (Studier et aL, Meth. Enzymol. 755:60-89, 1990), the lambda 
promoter (Elvin et al., Gene 57:123-126, 1990), the trp promoter (Nichols and 
20 Yanofsky, Meth. in Enzymology 101:155, 1983) and the tac promoter (Russell et aL, 
Gene 20: 231, 1982). Representative selectable markers include various antibiotic 
resistance markers such as the kanamycin or ampicillin resistance genes. Many 
plasmids suitable for transforming host cells are well known in the art, including among 
others, pBR322 (see Bolivar et aL, Gene 2:95, 1977), the pUC plasmids pUC18, 
25 pUC19, pUCl 18, pUCl 19 (see Messing, Meth. in Enzymology 707:20-77, 1983; Vieira 
and Messing, Gene 79:259-268, 1982), and pNH8A, pNH16a, pNH18a, and Bluescript 
Ml 3 (Stratagene, La Jolla, Calif.). 

Fungal host cells suitable for carrying out the present invention include, 
among others, Saccharomyces pombe, Saccharomyces cerevisiae, the genera Pichia or 
30 Kluyveromyces and various species of the genus Aspergillus (McKnight et aL, U.S. 
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Patent No. 4,935,349). Suitable expression vectors for yeast and fungi include, among 
others, YCp50 (ATCC No. 37419) for yeast, and the amdS cloning vector pV3 
(Turnbull, Bio/Technology 7:169, 1989), YRp7 (Struhl et al., Proc. Natl. Acad. Sci. 
USA 75:1035-1039, 1978), YEpl3 (Broach et al., Gene 5:121-133, 1979), pJDB249 and 

5 pJDB219 (Beggs, Nature 275:104-108, 1978) and derivatives thereof. 

Preferred promoters for use in yeast include promoters from yeast 
glycolytic genes (Hitzeman et al., J. Biol. Chem. 255:12073-12080, 1980; Alber and 
Kawasaki, J. Mol. Appl Genet. 7:419-434, 1982) or alcohol dehydrogenase genes 
(Young et al., in Genetic Engineering of Microorganisms for Chemicals, Hollaender et 

10 al. (eds.), p. 355, Plenum, New York, 1982; Ammerer, Meth. EnzymoL 707:192-201, 

1983) . Examples of useful promoters for fungi vectors include those derived from 
Aspergillus nidulans glycolytic genes, such as the adh3 promoter (McKnight et al., 
EMBO J. 4:2093-2099, 1985). The expression units may also include a transcriptional 
terminator. An example of a suitable terminator is the adh3 terminator (McKnight et 

15 al., ibid., 1985). 

As with bacterial vectors, the yeast vectors will generally include a 

selectable marker, which may be one of any number of genes that exhibit a dominant 

phenotype for which a phenotypic assay exists to enable transformants to be selected. 

Preferred selectable markers include those that complement host cell auxotrophy, 
20 provide antibiotic resistance or enable a cell to utilize specific carbon sources, and 

include leu2 (Broach et al., ibid.), ura3 (Botstein et al., Gene 8:17, 1979), or his3 (Struhl 

et al, ibid.). Another suitable selectable marker is the cat gene, which confers 

chloramphenicol resistance on yeast cells. 

Techniques for transforming fungi are well known in the literature, and 
25 have been described, for instance, by Beggs (ibid.), Hinnen et al. (Proc. Natl. Acad. Sci. 

USA 75:1929-1933, 1978), Yelton et al. (Proc. Natl. Acad. ScL USA #7:1740-1747, 

1984) , and Russell (Nature 507:167-169, 1983). The genotype of the host cell may 
contain a genetic defect that is complemented by the selectable marker present on the 
expression vector. Choice of a particular host and selectable marker is well within the 

30 level of ordinary skill in the art in light of the present specification. 
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Protocols for the transformation of yeast are also well known to those of 
ordinary skill in the art. For example, transformation may be readily accomplished 
either by preparation of spheroplasts of yeast with DNA (see Hinnen et al., PNAS USA 
75:1929, 1978) or by treatment with alkaline salts such as LiCl (see Itoh et al., J. 

5 Bacteriology 753:163, 1983). Transformation of fungi may also be carried out using 
polyethylene glycol as described by Cullen et al. (Bio/Technology 5:369, 1987). 

Viral vectors include those that comprise a promoter that directs the 
expression of an isolated nucleic acid molecule that encodes a Streptococcal Hsp60 as 
described above. A wide variety of promoters may be utilized within the context of the 

10 present invention, including for example, promoters such as MoMLV LTR, RSV LTR, 
Friend MuLV LTR, adenoviral promoter (Ohno et al., Science 265: 781-784, 1994), 
neomycin phosphotransferase promoter/enhancer, late parvovirus promoter (Koering et 
al., Hum. Gene Therap. 5:457-463, 1994), Herpes TK promoter, SV40 promoter, 
metallothionein Ha gene enhancer/promoter, cytomegalovirus immediate early 

15 promoter, and the cytomegalovirus immediate late promoter. The promoter may also be 
a tissue-specific promoter (see e.g., WO 91/02805; EP 0,415,731; and WO 90/07936). 
In addition to the above-noted promoters, other viral-specific promoters (e.g., retroviral 
promoters (including those noted above, as well as others such as HIV promoters), 
hepatitis, herpes (e.g., EBV), and bacterial, fungal or parasitic-specific (e.g., malarial- 

20 specific) promoters may be utilized in order to target a specific cell or tissue which is 
infected with a virus, bacteria, fungus or parasite. 

Thus, Streptococcal Hsp60 polypeptides of the present invention may be 
expressed from a variety of viral vectors, including for example, herpes viral vectors 
(e.g., U.S. Patent No. 5,288,641), adenoviral vectors (e.g., WO 94/26914, WO 93/9191; 

25 Kolls et al., PNAS 97(1):215-219, 1994; Kass-Eisler et al., PNAS 90(24): 11498-502, 
1993; Guzman et al., Circulation &S(6):2838-48, 1993; Guzman et al., Cir. Res. 
75(6): 1202-1207, 1993; Zabner et al., Cell 75(2):207-216, 1993; Li et al., Hum Gene 
Ther. 4(4):403-409, 1993; Caillaud et al., Eur. J. Neurosci. 5(10):1287-1291, 1993; 
Vincent et al., Nat Genet. 5(2):130-134, 1993; JafFe et al., Nat. Genet. 7(5):372-378, 

30 1992; and Levrero et al., Gene 1 01(2): 195-202, 1991), adenovirus-associated viral 
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vectors (Flotte et al., PNAS 90(22):1O613-1O617, 1993), baculovirus vectors, parvovirus 
vectors (Koering et al., Hum. Gene Therap. 5:457-463, 1994), pox virus vectors 
(Panicali and Paoletti, PNAS 79:4927-4931, 1982; and Ozaki et al., Biochem. Biophys. 
Res. Comm. 793(2):653-660, 1993), and retroviruses (e.g., EP 0,415,731; WO 

5 90/07936; WO 91/0285, WO 94/03622; WO 93/25698; WO 93/25234; U.S. Patent No. 
5,219,740; WO 93/1 1230; WO 93/10218. Within various embodiments, either the viral 
vector itself or a viral particle which contains the viral vector may be utilized in the 
methods and compositions described below. 

Mammalian cells suitable for carrying out the present invention include, 

10 among others: PC12 (ATCC No. CRL1721), N1E-115 neuroblastoma, SK-N-BE(2)C 
neuroblastoma, SHSY5 adrenergic neuroblastoma, NS20Y and NG108-15 murine 
cholinergic cell lines, or rat F2 dorsal root ganglion line, COS (e.g., ATCC No. CRL 
1650 or 1651), BHK (e.g., ATCC No. CRL 6281; BHK 570 cell line (deposited with 
the American Type Culture Collection under accession number CRL 10314), CHO 

15 (ATCC No. CCL 61), HeLa (e.g., ATCC No. CCL 2), 293 (ATCC No. 1 573; Graham et 
al., J. Gen. Virol. 36:59-72, 1977) and NS-1 cells. Other mammalian cell lines may be 
used within the present invention, including Rat Hep I (ATCC No. CRL 1600), Rat Hep 
II (ATCC No. CRL 1548), TCMK (ATCC No. CCL 139), Human lung (ATCC No. 
CCL 75.1), Human hepatoma (ATCC No. HTB-52), Hep G2 (ATCC No. HB 8065), 

20 Mouse liver (ATCC No. CCL 29.1), NCTC 1469 (ATCC No. CCL 9.1), SP2/0-Agl4 
(ATCC No. 1581), HIT-T15 (ATCC No. CRL 1777), and RINm 5AHT2B (Orskov and 
Nielson, FEBS 229(1):175-178, 1988). 

Mammalian expression vectors for use in carrying out the present 
invention include a promoter capable of directing the transcription of a cloned gene or 

25 cDNA. Preferred promoters include viral promoters and cellular promoters. Viral 
promoters include the cytomegalovirus immediate early promoter (Boshart et al., Cell 
41:52 1-530, 1985), cytomegalovirus immediate late promoter, SV40 promoter 
(Subramani et al., Mol. Cell. Biol. 7:854-864, 1981), MMTV LTR, RSV LTR, 
metallothionein-1, adenovirus El a. Cellular promoters include the mouse 

30 metallothionein-1 promoter (Palmiter et al., U.S. Patent No. 4,579,821), action 
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promoters, a mouse V H promoter (Bergman et al., Proc. Natl. Acad. Sci. USA #7:7041- 
7045, 1983; Grant et al., Nucl. Acids Res. 75:5496, 1987) and a mouse V H promoter 
(Loh et al., Cell 35:85-93, 1983). The choice of promoter will depend, at least in part, 
upon the level of expression desired or the recipient cell line to be transfected. 
5 Such expression vectors can also contain a set of RNA splice sites 

located downstream from the promoter and upstream from the DNA sequence encoding 
the peptide or protein of interest. Preferred RNA splice sites may be obtained from 
adenovirus and/or immunoglobulin genes. Also contained in the expression vectors is a 
polyadenylation signal located downstream of the coding sequence of interest. Suitable 

10 polyadenylation signals include the early or late polyadenylation signals from SV40 
(Kaufman and Sharp, ibid.), the polyadenylation signal from the Adenovirus 5 E1B 
region and the human growth hormone gene terminator (DeNoto et al., Nuc. Acids Res. 
9:3719-3730, 1981). The expression vectors may include a noncoding viral leader 
sequence, such as the Adenovirus 2 tripartite leader, located between the promoter and 

15 the RNA splice sites. Preferred vectors may also include enhancer sequences, such as 
the SV40 enhancer. Expression vectors may also include sequences encoding the 
adenovirus VA RNAs. Suitable expression vectors can be obtained from commercial 
sources (e.g., Stratagene, La Jolla, Calif). 

Vector constructs comprising cloned DNA sequences can be introduced 

20 into cultured mammalian cells by, for example, calcium phosphate-mediated 
transfection (Wigler et al., Cell 14:725, 1978; Corsaro and Pearson, Somatic Cell 
Genetics 7:603, 1981; Graham and Van der Eb, Virology 52:456, 1973), electroporation 
(Neumann et al., EMBO J. 7:841-845, 1982), or DEAE-dextran mediated transfection 
(Ausubel et al. (eds.), Current Protocols in Molecular Biology, John Wiley and Sons, 

25 Inc., NY, 1987). See generally Sambrook et al. (supra). To identify cells that have 
stably integrated the cloned DNA, a selectable marker is generally introduced into the 
cells along with the gene or cDNA of interest. Preferred selectable markers for use in 
cultured mammalian cells include genes that confer resistance to drugs, such as 
neomycin, hygromycin, and methotrexate. The selectable marker may be an 

30 amplifiable selectable marker. Preferred amplifiable selectable markers are the DHFR 
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gene and the neomycin resistance gene. Selectable markers are reviewed by Thilly 
{Mammalian Cell Technology, Butterworth Publishers, Stoneham, MA). 

Mammalian cells containing a suitable vector are allowed to grow for a 
period of time, typically 1-2 days, to begin expressing the DNA sequence(s) of interest. 
5 Drug selection is then applied to select for growth of cells that are expressing the 
selectable marker in a stable fashion. For cells that have been transfected with an 
amplifiable, selectable marker the drug concentration may be increased in a stepwise 
manner to select for increased copy number of the cloned sequences, thereby increasing 
expression levels. Cells expressing the introduced sequences are selected and screened 
10 for production of the protein of interest in the desired form or at the desired level. Cells 
that satisfy these criteria can then be cloned and scaled up for production. 

Numerous insect host cells known in the art can also be useful within the 
present invention, in light of the subject specification. For example, the use of 
baculoviruses as vectors for expressing heterologous DNA sequences in insect cells has 
15 been reviewed by Atkinson et al. (Pestic. ScL 25:215-224,1990). 

Numerous plant host cells known in the art can also be useful within the 
present invention, in light of the subject specification. For example, the use of 
Agrobacterium rhizogenes as vectors for expressing genes in plant cells has been 
reviewed by Sinkar et al., «/. Bioscl (Bangalore) 77:47-58, 1987. 
20 Upon expression of the Streptococcal Hsp60 polypeptides or fragments 

thereof in the host cells, the polypeptide or peptide may be preliminarily released and/or 
isolated from the host cell utilizing methods such as those discussed previously herein. 

As noted above, depending on the host cell in which one desires to 
express Hsp60, the gene encoding the protein is introduced into an expression vector 
25 comprising a promoter that is active in the host cell. Other components of the 
expression unit such as transcribed but not translated sequences at the ends of the 
coding region may also be selected according to the particular host utilized. In some 
cases, it may be necessary to introduce artificially an intervening sequence to ensure 
v high level expression. Expression can be monitored by SDS-PAGE and staining, if 
30 expression levels are sufficiently high. Additionally, if the protein is produced with a 
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tag, detection by anti-tag antibody can be carried out and if produced with no tag, 
detection by anti-Hsp60 antibody that does not recognize homologous proteins of the 
host may be employed. Further, any method known in the art for protein identification 
may be utilized to this end (e.g., a high resolution electrophoretic method or 2D 
5 electrophoresis). 

C* Preparation Of Antibodies Against the Hsp60 Polypeptides Of 
The Present Invention 

In another aspect, the proteins of the present invention are utilized to 

10 prepare specifically binding antibodies (i.e., binding partners). Accordingly, the present 
invention also provides such antibodies. Within the context of the present invention, 
the term "antibodies" includes polyclonal antibodies, monoclonal antibodies, anti- 
idiotype antibodies, fragments thereof such as F(ab')2 and Fab fragments, and 
recombinantly or synthetically produced binding partners. Such binding partners 

15 incorporate the variable regions that permit a monoclonal antibody to specifically bind, 
which means an antibody able to selectively bind to a peptide produced from one of the 
Streptococcal Hsp60 genes of the invention. The affinity of a monoclonal antibody or 
binding partner can be readily determined by one of ordinary skill in the art (see 
Scatchard, Ann. N. Y. Acad. Set 57:660-672, 1949). 

20 Polyclonal antibodies can be readily generated by one of ordinary skill in 

the art from a variety of warm-blooded animals such as horses, cows, goats, sheep, 
dogs, chickens, turkeys, rabbits, mice, or rats. Briefly, the desired protein or peptide is 
utilized to immunize the animal through intraperitoneal, intramuscular, intraocular, or 
subcutaneous injections. The immunogenicity of the protein or peptide of interest may 

25 be increased through the use of an adjuvant such as Freund's complete or incomplete 
adjuvant. Following several booster immunizations, small samples of serum are 
collected and tested for reactivity to the desired protein or peptide. 

Particularly preferred polyclonal antisera give a signal that is at least 
three times greater than background. Once the titer of the animal has reached a plateau 
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in terms of its reactivity to the protein, larger quantities of polyclonal antisera may be 
readily obtained either by weekly bleedings, or by exsanguinating the animal. 

Monoclonal antibodies can also be readily generated using well-known 
techniques (see U.S. Patent Nos. RE 32,011, 4,902,614, 4,543,439, and 4,411,993; see 
5 also Monoclonal Antibodies, Hybridomas: A New Dimension in Biological Analyses, 
Plenum Press, Kennett, McKeam, and Bechtol (eds.), 1980, and Antibodies: A 
Laboratory Manual, Harlow and Lane (eds.), Cold Spring Harbor Laboratory Press, 
1988). Briefly, in one embodiment, a subject animal such as a rat or mouse is injected 
with a desired protein or peptide. If desired, various techniques may be utilized in order 

10 to increase the resultant immune response generated by the protein, in order to develop 
greater antibody reactivity. For example, the desired protein or peptide may be coupled 
to another protein such as ovalbumin or keyhole limpet hemocyanin (KLH), or through 
the use of adjuvants such as Freund's complete or incomplete adjuvant. The initial 
elicitation of an immune response, may preferably be through intraperitoneal, 

1 5 intramuscular, intraocular, or subcutaneous routes. 

Between one and three weeks after the initial immunization, the animal 
may be reimmunized. The animal may then be test bled and the serum tested for 
binding to the desired antigen using assays as described above. Additional 
immunizations may also be accomplished until the animal has reached a plateau in its 

20 reactivity to the desired protein or peptide. The animal may then be given a final boost 
of the desired protein or peptide, and three to four days later sacrificed. At this time, the 
spleen and lymph nodes may be harvested and disrupted into a single cell suspension by 
passing the organs through a mesh screen or by rupturing the spleen or lymph node 
membranes which encapsulate the cells. Within one embodiment the red cells are 

25 subsequently lysed by the addition of a hypotonic solution, followed by immediate 
return to isotonicity. 

Within another embodiment, suitable cells for preparing monoclonal 
antibodies are obtained through the use of in vitro immunization techniques. Briefly, an 
animal is sacrificed, and the spleen and lymph node cells are removed as described 

30 above. A single cell suspension is prepared, and the cells are placed into a culture 
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containing a form of the protein or peptide of interest that is suitable for generating an 
immune response as described above. Subsequently, the lymphocytes are harvested and 
fused as described below. 

Cells that are obtained through the use of in vitro immunization or from 

5 an immunized animal as described above may be immortalized by transfection with a 
virus such as the Epstein-Barr Virus (EBV). (See Glasky and Reading, Hybridoma 
S(4):377-389, 1989.) Alternatively, within a preferred embodiment, the harvested 
spleen and/or lymph node cell suspensions are fused with a suitable myeloma cell in 
order to create a "hybridoma" which secretes monoclonal antibodies. Suitable myeloma 

10 lines are preferably defective in the construction or expression of antibodies, and are 
additionally syngeneic with the cells from the immunized animal. Many such myeloma 
cell lines are well known in the art and may be obtained from sources such as the 
American Type Culture Collection (ATCC), Rockville, Maryland (see Catalogue of 
Cell Lines & Hybridomas, 6th ed., ATCC, 1988). Representative myeloma lines 

15 include: for humans, UC 729-6 (ATCC No. CRL 8061), MC/CAR-Z2 (ATCC 
No. CRL 8147), and SKO-007 (ATCC No. CRL 8033); for mice, SP2/0-Agl4 (ATCC 
No. CRL 1581), and P3X63Ag8 (ATCC No. TIB 9); and for rats, Y3-Agl.2.3 (ATCC 
No. CRL 1631), and YB2/0 (ATCC No. CRL 1662). Particularly preferred fusion lines 
include NS-1 (ATCC No. TIB 18) and P3X63 - Ag 8.653 (ATCC No. CRL 1580), 

20 which may be utilized for fusions with either mouse, rat, or human cell lines. Fusion 
between the myeloma cell line and the cells from the immunized animal can be 
accomplished by a variety of methods, including the use of polyethylene glycol (PEG) 
(see Antibodies: A Laboratory Manual, Harlow and Lane, supra) or electrofusion. (See 
Zimmerman and Vienken, J. Membrane Biol 67:165-182, 1982.) 

25 Following the fusion, the cells are placed into culture plates containing a 

suitable medium, such as RPMI 1640 or DMEM (Dulbecco's Modified Eagles Medium, 
JRH Biosciences, Lenexa, Kan.). The medium may also contain additional ingredients, 
such as Fetal Bovine Serum (FBS, e.g., from Hyclone, Logan, Utah, or JRH 
Biosciences), thymocytes that were harvested from a baby animal of the same species as 

30 was used for immunization, or agar to solidify the medium. Additionally, the medium 
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should contain a reagent which selectively allows for the growth of fused spleen and 
myeloma cells. Particularly preferred is the use of HAT medium (hypoxanthine, 
aminopterin, and thymidine) (Sigma Chemical Co., St. Louis, Mo.). After about seven 
days, the resulting fused cells or hybridomas may be screened in order to determine the 

5 presence of antibodies which recognize the desired antigen. Following several clonal 
dilutions and reassays, hybridoma producing antibodies that bind to the protein of 
interest can be isolated. 

Other techniques may also be utilized to construct monoclonal 
antibodies. (See Huse et al., "Generation of a Large Combinational Library of the 

10 Immunoglobulin Repertoire in Phage Lambda," Science 246:1275-1281, 1989; see also 
Sastry et al., "Cloning of the Immunological Repertoire in Escherichia coli for 
Generation of Monoclonal Catalytic Antibodies: Construction of a Heavy Chain 
Variable Region-Specific cDNA Library," Proc, Natl Acad. Set USA 55:5728-5732, 
1989; see also Alting-Mees etal., "Monoclonal Antibody Expression Libraries: A 

15 Rapid Alternative to Hybridomas," Strategies in Molecular Biology 5:1-9, 1990; these 
references describe a commercial system available from Stratagene, La Jolla, California, 
which enables the production of antibodies through recombinant techniques.) Briefly, 
mRNA is isolated from a B cell population and utilized to create heavy and light chain 
immunoglobulin cDNA expression libraries in the AJMMUNOZAP(H) and X 

20 IMMUNOZAP(L) vectors. These vectors may be screened individually or co-expressed 
to form Fab fragments or antibodies (see Huse et al. 
(supra); see also Sastry et al. (supra)). Positive plaques can subsequently be converted 
to a non-lytic plasmid which allows high level expression of monoclonal antibody 
fragments from E. coli. 

25 Similarly, binding partners can also be constructed utilizing recombinant 

DNA techniques to incorporate the variable regions of a gene that encodes a specifically 
binding antibody. The construction of these binding partners can be readily 
accomplished by one of ordinary skill in the art given the disclosure provided herein. 
(See Larrick etal., "Polymerase Chain Reaction Using Mixed Primers: Cloning of 

30 Human Monoclonal Antibody Variable Region Genes From Single Hybridoma Cells," 
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Biotechnology 7:934-938, 1989; Riechmann et al., "Reshaping Human Antibodies for 
Therapy" Nature 352:323-327, 1988; Roberts etaL, "Generation of an Antibody with 
Enhanced Affinity and Specificity for its Antigen by Protein Engineering," Nature 
525:731-734, 1987; Verhoeyen etaL, "Reshaping Human Antibodies: Grafting an 

5 Antilysozyme Activity" Science 239:1534-1536, 1988; Chaudhary etaL, "A 
Recombinant Immunotoxin Consisting of Two Antibody Variable Domains Fused to 
Pseudomonas Exotoxin," Nature 339:394-397, 1989; see also U.S. Patent No. 
5,132,405 entitled "Biosynthetic Antibody Binding Sites.") Briefly, in one 
embodiment, DNA segments encoding the desired protein or peptide of interest-specific 

10 antigen binding domains are amplified from hybridomas that produce a specifically 
binding monoclonal antibody, and are inserted directly into the genome of a cell that 
produces human antibodies. {See Verhoeyen et aL (supra)\ see also Reichmann et al. 
(supra)). This technique allows the antigen-binding site of a specifically binding mouse 
or rat monoclonal antibody to be transferred into a human antibody. Such antibodies 

15 are preferable for therapeutic use in humans because they are not as antigenic as rat or 
mouse antibodies. 

In an alternative embodiment, genes that encode the variable region from 
a hybridoma producing a monoclonal antibody of interest are amplified using 
oligonucleotide primers for the variable region. These primers may be synthesized by 

20 one of ordinary skill in the art, or may be purchased from commercially available 
sources. For instance, primers for mouse and human variable regions including, among 
others, primers for V Ha , V H b, V Hc , V H d, C H i, V L and C L regions, are available from 
Stratagene (La Jolla, Calif). These primers may be utilized to amplify heavy or light 
chain variable regions, which may then be inserted into vectors such as 

25 IMMUNOZAP™(H) or IMMUNOZAP™(L) (Stratagene), respectively. These vectors 
may then be introduced into E. coli for expression. Utilizing these techniques, large 
amounts of a single-chain polypeptide containing a fusion of the V H and V L domains 
may be produced (see Bird et aL, Science 242:423-426, 1988). 
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Monoclonal antibodies and other binding partners can be produced in a 
number of host systems, including tissue cultures, bacteria, eukaryotic cells, plants and 
other host systems known in the art. 

Once suitable antibodies or binding partners have been obtained, they 
5 may be isolated or purified by many techniques well known to those of ordinary skill in 
the art (see Antibodies: A Laboratory Manual, Harlow and Lane (supra)). Suitable 
techniques include peptide or protein affinity columns, HPLC or RP-HPLC, purification 
on protein A or protein G columns, or any combination of these techniques. Within the 
context of the present invention, the term "isolated" as used to define antibodies or 

10 binding partners means "substantially free of other blood components." 

The binding partners of the present invention have many uses. For 
example, antibodies can be utilized in flow cytometry to identify cells bearing such a 
protein. Briefly, in order to detect the protein or peptide of interest on cells, the cells 
are incubated with a labeled monoclonal antibody which specifically binds to the 

15 protein of interest, followed by detection of the presence of bound antibody. Labels 
suitable for use within the present invention are well known in the art including, among 
others, flourescein isothiocyanate (FITC), phycoerythrin (PE), horse radish peroxidase 
(HRP), and colloidal gold. Particularly preferred for use in flow cytometry is FITC, 
which may be conjugated to purified antibody according to the method of Keltkamp in 

20 "Conjugation of Fluorescein Isothiocyanate to Antibodies. I. Experiments on the 
Conditions of Conjugation," Immunology 7^:865-873, 1970. (See also Keltkamp, 
"Conjugation of Fluorescein Isothiocyanate to Antibodies. II. A Reproducible 
Method," Immunology 75:875-881, 1970; Goding, "Conjugation of Antibodies with 
Fluorochromes: Modification to the Standard Methods," J. Immunol. Methods 73:215- 

25 226, 1970.) The antibodies can also be used to target drugs to Streptococcus as well as 
a diagnostic for determining Streptococcal infection. 



WO 99/35270 



PCT/CA98/01203 



29 

D. Assays That Utilize The Hsp60 Polypeptides, Or Antibodies 
Thereto, Of The Present Invention 

A variety of assays can be utilized in order to detect the Hsp60 
polypeptides from Streptococcus pneumoniae and Streptococcus pyogenes of the 
present invention, or antibodies that specifically bind to such Hsp60 polypeptides. 
Exemplary assays are described in detail in Antibodies: A Laboratory Manual Harlow 
and Lane (eds.), Cold Spring Harbor Laboratory Press, 1988. Representative examples 
of such assays include: countercurrent immuno-electrophoresis (CIEP), 
radioimmunoassays, radioimmunoprecipitations, enzyme-linked immuno-sorbent 
assays (ELISA), dot blot assays, inhibition or competition assays, and sandwich assays, 
immunostick (dipstick) assays, simultaneous immunoassays, immunochromatographic 
assays, immunofiltration assays, latex bead agglutination assays, immunofluorescent 
assays, biosensor assays, and low-light detection assays {see U.S. Patent Nos. 4,376,110 
and 4,486,530; see also Antibodies: A Laboratory Manual {supra)). 

A fluorescent antibody test (FA-test) uses a fluorescently labeled 
antibody able to bind to one of the proteins of the invention. For detection, visual 
determinations are made by a technician using fluorescence microscopy, yielding a 
qualitative result. In one embodiment, this assay is used for the examination of tissue 
samples or histological sections. 

In latex bead agglutination assays, antibodies to one or more of the 
proteins of the present invention are conjugated to latex beads. The antibodies 
conjugated to the latex beads are then contacted with a sample under conditions 
permitting the antibodies to bind to desired proteins in the sample, if any. The results 
are then read visually, yielding a qualitative result. In one embodiment, this format can 
be used in the field for on-site testing. 

Enzyme immunoassays (EIA) include a number of different assays able 
to utilize the antibodies provided by the present invention. For example, a 
heterogeneous indirect EIA uses a solid phase coupled with an antibody of the invention 
and an affinity purified, anti-IgG immunoglobulin preparation. Preferably, the solid 
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phase is a polystyrene microtiter plate. The antibodies and immunoglobulin preparation 
are then contacted with the sample under conditions permitting antibody binding, which 
conditions are well known in the art. The results of such an assay can be read visually, 
but are preferably read using a spectrophotometer, such as an ELISA plate reader, to 
yield a quantitative result. An alternative solid phase EIA format includes plastic- 
coated ferrous metal beads able to be moved during the procedures of the assay by 
means of a magnet. Yet another alternative is a low-light detection immunoassay 
format. In this highly sensitive format, the light emission produced by appropriately 
labeled bound antibodies are quantitated automatically. Preferably, the reaction is 
performed using microtiter plates. 

In an alternative embodiment, a radioactive tracer is substituted for the 
enzyme mediated detection in an EIA to produce a radioimmunoassay (RIA). 

In a capture-antibody sandwich enzyme assay, the desired protein is 
bound between an antibody attached to a solid phase, preferably a polystyrene 
microtiter plate, and a labeled antibody. Preferably, the results are measured using a 
spectrophotometer, such as an ELISA plate reader. 

In a sequential assay format, reagents are allowed to incubate with the 
capture antibody in a step wise fashion. The test sample is first incubated with the 
capture antibody. Following a wash step, an incubation with the labeled antibody 
occurs. In a simultaneous assay, the two incubation periods described in the sequential 
assay are combined. This eliminates one incubation period plus a wash step. 

A dipstick/immunostick format is essentially an immunoassay except 
that the solid phase, instead of being a polystyrene microtiter plate, is a polystyrene 
paddle or dipstick. Reagents are the same and the format can either be simultaneous or 
sequential. 

In a chromatographic strip test format, a capture antibody and a labeled 
antibody are dried onto a chromatographic strip, which is typically nitrocellulose or 
nylon of high porosity bonded to cellulose acetate. The capture antibody is usually 
spray dried as a line at one end of the strip. At this end there is an absorbent material 
that is in contact with the strip. At the other end of the strip the labeled antibody is 
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deposited in a manner that prevents it from being absorbed into the membrane. Usually, 
the label attached to the antibody is a latex bead or colloidal gold. The assay may be 
initiated by applying the sample immediately in front of the labeled antibody. 

Immunofiltration/immunoconcentration formats combine a large solid 
5 phase surface with directional flow of sample/reagents, which concentrates and 
accelerates the binding of antigen to antibody. In a preferred format, the test sample is 
preincubated with a labeled antibody then applied to a solid phase such as fiber filters or 
nitrocellulose membranes or the like. The solid phase can also be precoated with latex 
or glass beads coated with capture antibody. Detection of analyte is the same as 

10 standard immunoassay. The flow of sample/reagents can be modulated by either 
vacuum or the wicking action of an underlying absorbent material. 

A threshold biosensor assay is a sensitive, instrumented assay amenable 
to screening large numbers of samples at low cost. In one embodiment, such an assay 
comprises the use of light addressable potentiometric sensors wherein the reaction 

15 involves the detection of a pH change due to binding of the desired protein by capture 
antibodies, bridging antibodies and urease-conjugated antibodies. Upon binding, a pH 
change is effected that is measurable by translation into electrical potential (javolts). 
The assay typically occurs in a very small reaction volume, and is very sensitive. 
Moreover, the reported detection limit of the assay is 1,000 molecules of urease per 

20 minute. 

The present invention also provides for probes and primers for detecting 
Streptococcus pneumoniae and Streptococcus pyogenes. 

In one embodiment of this aspect of the invention, probes are provided 
that are capable of specifically hybridizing to S. pneumoniae and S. pyogenes Hsp60 
25 genes DNA or RNA. For purposes of the present invention, probes are "capable of 
hybridizing" to S. pneumoniae and S. pyogenes Hsp60 genes DNA or RNA if they 
hybridize under conditions of high stringency (see Sambrook et al. (supra)). Preferably, 
the probe may be utilized to hybridize to suitable nucleotide sequences under highly 
stringent conditions, such as 6x SSC, lx Denhardt's solution (Sambrook et al. (supra)), 
30 0.1% SDS at 65°C and at least one wash to remove excess probe in the presence of 0.2x 
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SSC, lx Denhardt's solution, 0.1% SDS at 65°C. Except as otherwise provided herein, 
probe sequences are designed to allow hybridization to Streptococcal DNA or RNA 
sequences, but not to DNA or RNA sequences from other organisms, particularly other 
bacterial sequences. The probes are usied, for example, to hybridize to nucleic acid that 
has been exposed from a cell in a sample. The hybridized probe is then detected, 
thereby indicating the presence of the desired cellular nucleic acid. Preferably, the 
cellular nucleic acid is subjected to an amplification procedure, such as PCR, prior to 
hybridization. 

Probes of the present invention may be composed of either 
deoxyribonucleic acids (DNA) or ribonucleic acids (RNA), and may be as few as about 
12 nucleotides in length, usually about 14 to 18 nucleotides in length, and possibly as 
large as the entire sequence of the S. pneumoniae and S. pyogenes Hsp60 genes. 
Selection of probe size is somewhat dependent upon the use of the probe, and is within 
the skill of the art. 

Suitable probes can be constructed and labeled using techniques that are 
well known in the art. Shorter probes of, for example, 12 bases can be generated 
synthetically. Longer probes of about 75 bases to less than 1.5 kb are preferably 
generated by, for example, PCR amplification in the presence of labeled precursors such 
as [a- 32 P]dCTP, digoxigenin-dUTP, or biotin-dATP. Probes of more than 1.5 kb are 
generally most easily amplified by transfecting a cell with a plasmid containing the 
relevant probe, growing the transfected cell into large quantities, and purifying the 
relevant sequence from the transfected cells. (See Sambrook et al. (supra)). 

Probes can be labeled by a variety of markers, including for example, 
radioactive markers, fluorescent markers, enzymatic markers, and chromogenic 
markers. The use of **2p i s particularly preferred for marking or labeling a particular 
probe. 

It is a feature of this aspect of the invention that the probes can be 
utilized to detect the presence of 5. pneumoniae and S. pyogenes Hsp60 mRNA or DNA 
within a sample. However, if the bacteria are present in only a limited number, then it 



WO 99/35270 



PCT/CA98/01203 



33 

may be beneficial to amplify the relevant sequence such that it may be more readily 
detected or obtained. 

A variety of methods may be utilized in order to amplify a selected 
sequence, including, for example, RNA amplification {see Lizardi et al., 

5 Bio/Technology 6:1197-1202, 1988; Kramer et ah, Nature 339:401-402, 1989; Lomeli 
etal., Clinical Chem. 35(9): 1826-1 831, 1989; U.S. Patent No. 4,786,600), and DNA 
amplification utilizing LCR or Polymerase Chain Reaction ("PCR") {see U.S. Patent 
Nos. 4,683,195, 4,683,202, and 4,800,159; see also U.S. Patent Nos. 4,876,187 and 
5,011,769, which describe an alternative detection/amplification system comprising the 

10 use of scissile linkages), or other nucleic acid amplification procedures that are well 
within the level of ordinary skill in the art. With respect to PCR, for example, the 
method may be modified as known in the art. PCR may also be used in combination 
with reverse dot blot hybridization (Iida et al., FEMS Microbiol Lett 114:161-172, 
1993). PCR products may be quantitatively analyzed by incorporation of dUTP 

15 (Duplaa et al., Anal. Biochem. 272:229-236, 1993), and samples may be filter sampled 
for PCR-gene probe detection (Bej et aL, Appl Environ, Microbiol 57:3529-3534, 
1991). 

Within a preferred embodiment, PCR amplification is utilized to detect 
S. pneumoniae and 5. pyogenes Hsp60 DNA. Briefly a DNA sample is denatured at 95° 

20 C in order to generate single-stranded DNA. Specific primers are then annealed to the 
single-stranded DNA at 37°C to 70°C, depending on the proportion of AT/GC in the 
primers. The primers are extended at 72°C with Taq DNA polymerase in order to 
generate the opposite strand to the template. These steps constitute one cycle, which 
may be repeated in order to amplify the selected sequence. 

25 Within an alternative preferred embodiment, LCR amplification is 

utilized for amplification. LCR primers are synthesized such that the 5 f base of the 
upstream primer is capable of hybridizing to a unique base pair in a desired gene to 
specifically detect a strain of Streptococcus harboring the desired gene. 

Within another preferred embodiment, the probes are used in an 

30 automated, non-isotopic strategy wherein target nucleic acid sequences are amplified by 
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PCR, and then desired products are determined by a colorimetric oligonucleotide 
ligation assay (OLA) (Nickerson et al., Proc. Natl Acad. Set USA <S7:8923-8927, 
1990). 

Primers for the amplification of a selected sequence should be selected 
5 from sequences that are highly specific and form stable duplexes with the target 
sequence. The primers should also be non-complementary, especially at the 3* end, 
should not form dimers with themselves or other primers, and should not form 
secondary structures or duplexes with other regions of DNA, In general, primers of 
about 18 to 20 nucleotides are preferred, and can be easily synthesized using techniques 

10 well known in the art. PCR products, and other nucleic acid amplification products, 
may be quantitated using techniques known in the art (Duplaa et al., Anal Biochem. 
2/2:229-236, 1993; Higuchi et al., Bio/Technology 77:1026-1030). 

Further a biochip array specific for Streptococcus, comprised of a 
substrate to which either oligonucleotides or polypeptides may be bound can be 

1 5 manufactured using the invention disclosed herein in combination with current biochip 
technologies. U.S. Patent No. 5,445,934. By using such a substrate with 
oligonucleotides derived from the Streptococcal Hsp60 sequences or antibodies specific 
for the Streptococcal gene products of this invention, a high throughput screening tool 
can be created to identify the specific Streptococcal strain in many samples. 

20 

E. Pharmaceutical Compositions And Methods 

Another aspect of the present invention provides compositions and 
methods comprising one or more of the above-described Streptococcal Hsp60 
polypeptides or antibodies to Streptococcal Hsp60 in combination with one or more 

25 pharmaceutical^ or physiologically acceptable carriers, adjuvants, binders or diluents. 
Such compositions can be used to elicit or enhance an immune response in a recipient 
animal, which is preferably a human being, and preferably elicits or enhances a 
protective or partially protective immunity against Streptococcus, or against an 
organism associated with an antigen fused to the Streptococcal Hsp60s of the present 

30 invention. 
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Preferably, such carriers, adjuvants, binders or diluents are nontoxic to 
recipients at the dosages and concentrations employed. Ordinarily, the preparation of 
such compositions entails combining the isolated Streptococcal Hsp60 polypeptide with 
buffers, antioxidants such as ascorbic acid, low molecular weight (less than about 10 
5 residues) polypeptides, proteins, amino acids, carbohydrates including glucose, sucrose 
or dextrins, chelating agents such as EDTA, glutathione and other stabilizers and 
excipients. Neutral buffered saline or saline mixed with nonspecific serum albumin are 
exemplary appropriate diluents. Examples of adjuvants include alum or aluminum 
hydroxide for humans. 

10 It will be evident in light of the present specification to those in the art 

that the amount and frequency of administration can be optimized in clinical trials, and 
will depend upon such factors as the disease or disorder to be treated, the degree of 
immune inducement, enhancement, or protection required, and many other factors. 

In one embodiment, the composition is administered orally, and the 

15 purified Streptococcal Hsp60 is taken up by cells, such as cells located in the lumen of 
the gut. Alternatively, the Streptococcal Hsp60 composition can be parenterally 
administrated via the subcutaneous route, or via other parenteral routes. Other routes 
include buccal/sublingual, rectal, nasal, topical (such as transdermal and ophthalmic), 
vaginal, pulmonary, intraarterial, intramuscular, intraperitoneal, intraocular, intranasal 

20 or intravenous, or indirectly. The Streptococcal Hsp60 compositions of the present 
invention can be prepared and administered as a liquid solution, or prepared as a solid 
form (e.g., lyophilized) which can be administered in solid form or resuspended in a 
solution in conjunction with administration. 

Depending upon the application, quantities of injected Streptococcal 

25 Hsp60 in the composition will vary generally from about 0.1 \xg to 1000 mg, typically 
from about 1 jig to 100 mg, preferably from about 10 jig to 10 mg, and preferably from 
about 100 |ig to 1 mg, in combination with the physiologically acceptable carrier, 
binder or diluent. Booster immunizations can be given from 2-6 weeks later. 

The pharmaceutical compositions of the present invention may be placed 

30 within containers, along with packaging material, preferably consumer-acceptable, 
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which provides instructions regarding the use of such pharmaceutical compositions, to 
provide kits suitable for use within the present invention. Generally, such instructions 
will include a tangible expression describing the reagent concentration, as well as 
within certain embodiments, relative amounts of excipient ingredients or diluents {e.g., 
5 water, saline or PBS) which may be necessary to reconstitute the pharmaceutical 
composition. 

The Hsp gene products of this invention may also be used as 
immunological carriers in conjugate vaccines. Hsps are beneficial carriers of antigens 
because, unlike other carriers, they do not have an immunosuppressive effect. See 

10 Barrios et al., Eur. J. Immunol 22:1365-1372, 1992; Suzue and Young, in Stress- 
Inducible Cellular Responses 77:451-465, 1996 (edited by U. Feige et al.). Such 
carriers may be used to elicit an increased immune response to the conjugated molecule. 
The Streptococcal Hsp gene products of this invention may therefore be used as carriers 
(in conjugates or fusion proteins). 

15 An additional aspect of the present invention is the use of the 

Streptococcal Hsp60 genes and gene products to treat and/or prevent tumors. The 
methods comprise administering to an individual having cancer a composition 
comprising a purified Streptococcal Hsp60 gene product as discussed herein in an 
amount effective to elicit and/or enhance the immune response of an individual against 

20 the cancer. The present invention also provides a method of immunizing an individual 
against cancer, or of providing at least a partially effective immunoprotective response 
in such an individual, the method comprising administering to the individual a 
composition comprising a purified Streptococcal Hsp60 as discussed herein in an 
amount effective to immunize the individual. 

25 Preferably, the treatment of cancer comprises the use of highly purified 

Streptococcal Hsp60 gene products that are substantially free of endotoxins and 
methods and compositions related to the same. Such highly purified proteins are 
particularly advantageous, for example, for the treatment of human cancers because 
they do not incur the adverse side effects associated with such endotoxins. In particular, 

30 the compositions are capable of inducing an immune response against a cancer existing 
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within an individual, which includes both eliciting the immune response or enhancing 
the immune response against the cancer. For example, the cancer to be treated may be 
an endothelial cell cancer, such as a sarcoma and/or breast, ovarian, prostate, lung, 
pancreas and liver cancers. The present invention also provides compositions that are 
5 capable of providing either partially or folly protective immune responses by 
immunization against cancers that are not yet present within an individual. 

A further aspect of the present invention is protection from a variety of 
bacterial diseases by either immunization with the Hsp60 gene products of the present 
invention or by using gene transfer techniques to deliver a vector containing 
10 Streptococcal Hsp60 genes or fragments thereof to be expressed within the cells of the 
animal. The compositions and methods of the present invention can also provide for 
cancer prevention. 

The compositions and methodologies described herein are suitable for a 
variety of uses. To this end, the following examples are presented for purposes of 
1 5 illustration, not limitation. 

EXAMPLES 
EXAMPLE 1 

20 ISOLATION OF GENES FOR STREPTOCOCCUS PNEUMONIAE AND 

STREPTOCOCCUS PYOGENES HSPS 

Genomic DNA from Streptococcus pneumoniae (ATCC6314) and 
Streptococcus pyogenes (ATCC 12344), prepared by a routine method, wasobtained 
25 from Dr. Lee Weber, University of Nevada at Reno. 

Hsp60 DNA sequences were isolated by use of the polymerase chain 
reaction. Primers were designed based on N- and C-terminal homology of known 
Hsp60 sequences from other organisms. DNA amplifications of Streptococcal DNA 
were carried out using Taq polymerase (Perkin-Elmer). About 20 different primer pairs 
30 were tested using different reaction conditions. One pair (pair 1) was identified that was 
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capable of amplifying Hsp60-1 genes, and a second (pair 2) that permitted amplification 
of Hsp60-2 sequences. Reaction mixtures capable of amplifying Hsp60 sequences 
contained, in a total volume of 50 \i\ 9 0.5 |xg of genomic DNA, 50 pmoles of each of a 
pair of degenerate primers, 500 yM each of dNTPs, lxPCR buffer (Perkin-Elmer), 2 
5 mM MgS0 4 , and 1.25 units of Taq polymerase (Perkin-Elmer). The following two 
pairs of degenerate primers were employed successfully: 

Pairl: 

forward primer #1F: 5 '-CATATGGCNGCNAAAGAYGTAAAA-S * (SEQ ID NO:35) 
10 reverse primer #1R: 5 , -TGATCACATCATNCCNCCCATNCC-3 , (SEQ ID NO:36) 

Pair 2: 

forward primer #2F: 5 '-CATATGGCAAAAGAAATHAARTTY-3 * (SEQIDNO:37) 
reverse primer #2R: 5'-TGATCANCCNCCCATNCCNCCCAT-3' (SEQIDNO:38) 

15 

In the above sequences, N refers to A, C, G or T, and H to A, C or T (not 

G). 

Reactions were cycled 35 times at 94°C for 1 minute, 50°C for 2 minutes 
and 72°C for 2 minutes. PCR products were electrophoresed on 0.6% low-melting point 

20 agarose gels (Gibco-BRL) along with molecular weight markers. After staining with 
ethidium bromide, DNA fragments were visualized under low-intensity, long- 
wavelength UV illumination, and fragments of about 1.6 kbp were excised. DNA was 
isolated from gel slices by phenol extraction and ethanol precipitation (Maniatis et al.). 
Purified fragments were ligated to pCRII TA cloning vector (Invitrogen), and ligation 

25 mixtures were used to transform E. coli strain DH5a. (Competent cells obtained from 
Life Technologies.) Recombinant plasmids were isolated from kanamycin-resistant 
colonies by a standard alkaline lysis method, and the presence in plasmids of DNA 
inserts was verified by digestion with EcoRI digestion followed by agarose gel 
electrophoresis and visualization of digestion products by staining with ethidium 

30 bromide. 
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EXAMPLE 2 

NUCLEOTIDE SEQUENCE ANALYSIS OF STREPTOCOCCAL HSP60 

5 Inserts present in recombinant pCRH-based clones were sequenced using 

a CircumVent sequencing kit (New England Biolabs), 35 S-dATP and primers listed 
below. Multiple clones containing particular Streptococcal Hsp60 genes were 
sequenced: sequences were obtained from five clones, derived from three independent 
PCR reactions, of the Streptococcus pneumoniae Hsp60-1 gene, two clones, derived 

10 from single PCR reactions, of the Streptococcus pneumoniae Hsp60-2 gene, four 
clones, derived from three independent PCR reactions, of the Streptococcus pyogenes 
Hsp60-1 gene, and two clones and a portion of a third clone, derived from single PCR 
reaction, of the Streptococcus pyogenes Hsp60-2 gene. Sequencing reactions were 
fractionated on denaturing 6% polyacrylamide-8M urea gels (60 cm length), and the 

15 gels were dried and exposed for autoradiography. Autoradiographs were read manually, 
and sequence data were assembled and compared to other known Hsp60 genes using 
DNA Strider software (CEA, France). 



20 



Sequencing primers used: 



M13F : 5 ' -GTAAAACGACGGCC AG-3 ' (SEQ ID NO:39) 
M13R : 5 ' -C AGG AAAC AGCT ATGAC-3 ' (SEQ ID NO:40) 
W178 : 5'-CCAACCATCACGAAAGA-3' (SEQ ID NO:41) 
W179 : 5 ' - ACGGGTC ACTTTGGTTG-3 ' (SEQ ID NO:42) 

25 W189 : 5'-TTACTAATGACGGGGTA-3' (SEQIDNO:43) 
Wl 90 : 5 '-TTACCAATGACGGTGTG-3 * (SEQ ID NO:44) 
W191 : 5 ' - ACAGGGTCAATGATTCC-3 ' (SEQ ID NO:45) 
W192 : 5'-ACTGGATCAATGATACC-3' (SEQIDNO:46) 
Wl 95 : 5 ' -CCGT ACCGTGCTCTGAC-3 ' (SEQ ID NO:47) 

30 W196 : 5'-ACCACGTTTCAGATCCA-3' (SEQIDNO:48) 
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W197 : 5'-GACAGTTTCGCGGCAAC-3' (SEQ ID NO:49) 
W198 : 5 ' -CTC AG AACG AAG ATC AG-3 ' (SEQ ID NO:50) 
W200 : 5 '-GGTATGC AGTTCGACCG-3 ' (SEQ ID NO:5 1) 
W201 : 5 '-CCGTGTTGGTC AAATCC-3 ' (SEQ ID NO:52) 
5 W202 : 5'-GGTAACTACGGTTACAA-3' (SEQIDNO:53) 
W203 : S'-GAGGCCACTTCTTTCAC-S' (SEQ ID NO:54) 
W204 : 5'-GGCTTCCAGCACTGGCA-3' (SEQ ID NO:55) 
W205 : 5'-AACTTCAGTCGCAGCAC-3' (SEQ ID NO:56) 
W206 : 5 ' -CCTTGAAAGCC ATTGCT-3 ' (SEQ ID NO:57) 

10 W207 : 5'-GCTACACGTGCAGCCGT-3' (SEQ ID NO:58) 
W208 : 5 '-GCTGCAACAGGTGAGTG-3 ' (SEQ ID NO:59) 
W209 : 5 ' -TC ATGAACAATGGCTTG-3 ' (SEQ ID NO:60) 
W210 : 5 ' - ACG AAGC AC AATGTT AC-3 ' (SEQ ID NO:61) 
W21 1 : 5 ' - ATC ACTAAAGATGGTGT-3 ' (SEQ ID NO:62) 

15 W214 : S'-GCAGTTGCCGCAGCAGT-S' (SEQIDNO:63) 
W215 : 5 ' -GCT ACTCGTGC AGCTGT-3 ' (SEQ ID NO:64) 
W216 : 5 ' -GTTCTCCGTGCTTTGGA-3 ' (SEQ ID NO:65) 
W217 : 5 ' -GC ACCTGCTGTGACGTT-3 ' (SEQ ID NO:66) 
W21 8 : 5 ' -TCTTCGATGGTG ATGAC-3 ' (SEQ ID NO:67) 

20 W219 : 5'-GGCAAGAGCTGTTCCGC-3' (SEQIDNO:68) 
W220 : 5 '-CTGAGCC AGTACGGTTG-3 ' (SEQ ID NO:69) 
W221 : 5 ' -GTACTGC AGAGCGGAAC-3 ' (SEQ ID NO:70) 
W224 : 5 ' - ACCGTCTTC AACGGTG A-3 ' (SEQ ID NO:71) 
W225 : 5 ' -GTT ATCATTGCTGAAG A-3 ' (SEQ ID NO:72) 

25 W226 : 5'-ACGGTACCGCCGGTCAG-3' (SEQIDNO:73) 
W227 : 5 ' -CTGGGCC AGGCTAAACG-3 ' (SEQ ID NO:74) 
W228 : 5 ' -CGACTGAAGTTG AAATG-3 ' (SEQ ID NO:75) 
W229 : 5 ' -GCTGTTG AAG AACTGAA-3 ' (SEQ ID NO:76) 
W230 : 5 ' -GTCTTC AACGGTG ATC A-3 ' (SEQ ID NO:77) 

30 W232 : 5'-TCTTCTACCGCAGCACG-3' (SEQIDNO:78) 
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W233 : 5'-CTCTTGATTATTGCGGA-3' (SEQIDNO:79) 

W234 : 5 '-TTGTTC AAAAC AAGAGT-3 ' (SEQ ID NO:80) 

W235 : 5'-CGATTATTGTAGAAGGT-3' (SEQ ID NO:81) 

W236 : 5'-CTTGATAACCGCAACAC-3' (SEQIDNO:82) 
5 W237 : 5'-TCCAAAGCACGGAGAAC-3' (SEQIDNO:83) 

W238 : 5 ' -GTGTC AAAC ATCCAAGA-3 ' (SEQ ID NO:84) 

W239 : 5 '-TCTTCGATGGT AATCAC-3 ' (SEQ ID NO:85) 

W240 : 5 '-GC AATAATGAGTAATGG-3 ' (SEQ ID NO:86) 

W241 : 5'-ACAGTAATTGTTGAAGG-3' (SEQIDNO:87) 
10 W242 : 5'-CAGTGCAATACGGTTAG-3' (SEQIDNO:88) 

W243 : 5'-AGCTTCCAGAACCGGCA-3' (SEQIDNO:89) 

W244 : 5 ' -CTGATC ATCGCTGAAGA-3 ' (SEQ ID NO:90) 

W245 : 5 ' - ACGGTTATTGTAGAAG-3 ' (SEQ ID NO:91) 

15 The sequencing strategy for each of the Hsp60 genes is set forth in 

Figure 5. The nucleotide sequences of the Streptococcus pneumoniae Hsp60-1 gene 
(referred to as P60-1), the Streptococcus pneumoniae Hsp60-2 gene (P60-2), the 
Streptococcus pyogenes Hsp60-1 gene (Y60-1) and the Streptococcus pyogenes Hsp60- 
2 gene (Y60-2), and the corresponding deduced amino acid sequences, are set forth in 

20 Figures 1 -4 (SEQ ID NOS: 1 -8). 

Comparisons of Streptococcal Hsp60 proteins and mycobacterial Hsp65 
and GroEL proteins were determined using the MegAlign module of a DNA Star 
software package (DNASTAR, Inc.), and sequence similarities to Genbank-listed genes 
and proteins were uncovered using the BLAST algorithm (National Center for 

25 Biotechnology Information, NIH, Bethesda, MD). One comparison of such sequences 
is set forth in Figure 1 0. 
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EXAMPLE 3 

EXPRESSION OF RECOMBINANT STREPTOCOCCAL HSP60 

Inserts (Hsp60 genes) were excised from recombinant pCRII-based 
5 plasmids with restriction enzymes Ndel and EcoRL Ndel cut inside forward PCR 
primers #1F or #2F, and EcoRI cut a short distance downstream from reverse PCR 
primers #1R or #2R in the polylinker region of vector PCRII. DNA fragments 
including Hsp60 gene sequences were fractionated on low-melting-point agarose gels, 
purified from the gels and ligated into Ndel/EcoRI double-digested pET28a(+) vector 

10 DNA (Novagen). Ligation reactions were used to transform competent Escherichia coli 
DH5a cells, and transformants were selected on Luria Broth plates containing 30jag/ml 
of kanamycin D. DNA was isolated from single colonies using a standard alkaline lysis 
method, and the presence of correct inserts verified by digestion with Ndel and EcoRI 
and agarose gel electrophoresis. The resulting expression plasmids contained either a 

15 Streptococcus pneumoniae Hsp60-1 gene (referred to as pETP60-l), a Streptococcus 
pneumoniae Hsp60-2 gene (pETP60-2), a Streptococcus pyogenes Hsp60-1 gene 
(pETY60-l) or a Streptococcus pyogenes Hsp60-2 gene (pETY60-2). Schematic maps 
of the expression plasmids are shown in Figures 6-9. 

To test whether they were capable of expressing the inserted 

20 Streptococcal Hsp60 genes, the expression plasmids were introduced into Escherichia 
coli strain BL21(DE3) by electroporation, and transformant colonies were selected on 
kanamycin-containing plates as before. Cultures of one ml were inoculated with single 
colonies, and transformants were grown at 37°C, until the cultures were turbid. After 
removing an aliquot for analysis of proteins prior to induction of recombinant genes 

25 (uninduced cultures), isopropyl-thio-galactopyranoside (IPTG) was added to lmM, and 
cultures were incubated for an additional one or two hours (induced cultures). Aliquots 
of lOOjal of induced and uninduced cultures were centrifuged at 12,000 x g for 30 
seconds. Bacterial pellets were lysed in lOOpl of SDS-PAGE loading buffer and boiled 
for 3 minutes. Aliquots of lOjxl of lysates were analyzed by 10% SDS-PAGE. 

30 Recombinant Streptococcal Hsp60 proteins were detectable after Coomasie blue 
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staining as prominent bands migrating with an apparent molecular weight of about 
60kDa, which bands were present in induced but not in uninduced samples. 

EXAMPLE 4 

5 PURIFICATION OF RECOMBINANT STREPTOCOCCAL HSP60 

Bacteria containing recombinant Streptococcal Hsp60 expression 
plasmids were grown in 2xYT medium (20 g Tryptone, 10 g yeast extract, 10 g NaCl 
per liter) supplemented with 30|j,g/ml of kanamycin D at 37°C to an optical density at 
10 600 nm of 0.5-0.8 and then induced with 0.5 mM IPTG for 3 hours. Cultures were then 
chilled on ice, and bacteria collected by centrifiigation at 7,000 x g for 5 min (at 4°C). 
Bacterial pellets were frozen at -80°C. 

Frozen bacterial pellet was crushed, transferred to a blender and 
homogenized in 200ml of buffer A (6 M guanidinium hydrochloride, 50 mM Tris-HCl 
15 pH 7.5, 0.5 mM beta-mercaptoethanol). 

Lysate was cleared by centrifiigation at 10,000 x g for 15 min (at 4°C). 
The supernatant solution was mixed overnight at room temperature with approximately 
100ml of slurry containing 50ml of Ni-Sepharose (Chelating Sepharose, Pharmacia) 
equilibrated in buffer A. The resin was then washed on filter paper with approximately 
20 200 ml buffer A, resuspended in small volume of the same buffer and gravity-packed 
into glass chromatography column (Pharmacia). 

The column was washed with 20 ml of buffer A with 1% Triton X-100. 
The column was further washed with a 6 - 0 M guanidinium hydrochloride / 0 - 1 M 
NaCl gradient in 50 mM Tris-HCl pH 7.5, 0.5 mM beta-mercaptoethanol (200ml), then 
25 with 200 ml of 50 mM Tris-HCl pH 7.5, 1 M NaCl, 0.5 mM beta-mercaptoethanol, and 
finally with 200 ml of 50 mM imidazole, 0.5 M NaCl, 50 mM Tris-HCl pH 7.5, 1.025 
M NaCl, 0.5 mM beta-mercaptoethanol. Then column was developed with a 200ml- 
gradient from 5% to 100% of buffer composed of 1 M imidazole, 0.5 M NaCl, 50 mM 
Tris-HCl pH 7.5, 0.5 mM beta-mercaptoethanol in 1M NaCl, 50 mM Tris-HCl pH 7.5, 
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0.5 mM beta-mercaptoethanol. Fractions of 9ml were collected. The flow rate was 4-5 
ml/min, and chromatography was monitored by absorbance at 280 nm. 

Fractions containing the highest concentrations of recombinant protein 
were identified by 10% SDS-PAGE as before, pooled (usually 5-6 fractions) into a 
5 dialysis bag (12 kDa cutoff). Protein solution (approximately 50 ml) was then dialysed 
in the cold against three changes of 3 liters of Dulbeccos* phosphate-buffered saline 
(2.7 mM KH 2 P0 4 , 4.3 mM Na^PO^ 2.7 mM KC1, 0.137 M NaCl). Dialysed protein 
was aliquoted and stored at -80°C. Usually 200-400 mg of recombinant protein were 
obtained (estimated by protein assay according to Lowry). 

10 

EXAMPLE 5 

CHARACTERIZATION OF PURIFIED, RECOMBINANT HSP60 



To unambiguously identify recombinant proteins as Streptococcal 
15 Hsp60, purified recombinant proteins were subjected to 

N- and C-terminal sequencing (conducted by the Protein Chemistry 
Facility, W. Alton Jones Cell Science Center, Lake Placid, NY). These determinations 
revealed that purified recombinant proteins had the C- and N-terminal sequences 
predicted from the deducted amino acid sequences of SEQ ID NOS:5-8 (except for the 
20 N-terminal methionine that is typically processed away in E. coli bacteria). 

EXAMPLE 0 

REACTIVITY OF RECOMBINANT STREPTOCOCCAL HSP60 WITH KNOWN 
ANTI-HSP60 MONOCLONAL ANTIBODIES 

25 

Purified recombinant Streptococcal Hsp60 proteins were analyzed for 
reactivity with the following commercially available antibodies: 

A) Rabbit polyclonal antibody SPA-804 (StressGen Biotechnologies) 
which was raised against Synechococcus sp. Hsp60. The antibody 
30 recognizes Hsp60 from a wide range of prokaryotes and eukaryotes 
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including cyanobacteria, Escherichia coli, and primate, murine, 
hamster, and rat cell lines. 

B) Murine monoclonal antibody SPA-807 (StressGen Biotechnologies) 
which was raised against human Hsp60. Its epitope is located 

5 between residues 383-419 of that protein. The antibody also cross- 

reacts with Hsp60 from various other species including primates, 
rabbit, mouse, rat, hamster, Borrelia sp., Escherichia coli, 
Streptococcus pyogenes, Yersinia enterocolitica, Salmonella 
typhimurium, Treponema hyodysenteriae, Treponema innocense, 
10 Trichinella spiralis, yeast, and spinach chloroplasts. 

C) Murine monoclonal antibody SPA-870 (StressGen Biotechnologies) 
which was raised against Escherichia coli GroEL. The antibody 
does not recognize eukaryotic Hsp60 proteins. 

D) Murine polyclonal antibody which was raised against 
15* Mycobacterium tuberculosis BCG Hsp60 (StressGen 

Biotechnologies). The antibody does not cross-react with 
Escherichia coli groEL or eukaryotic Hsp60. 

E) Murine monoclonal antibody recognizing recombinant histidine tag 
(Qiagen). 

20 

Samples containing 0.1 jxg, 0.5|ig or Ijag of recombinant protein were 
fractionated on 10% SDS-PAGE, and proteins were electroblotted onto nitrocellulose. 
Blots for analysis with antibodies SPA-804, SPA-807, SPA-870, and anti-BCG Hsp60 
were blocked with 5% skim milk in PBS containing 0.05% Tween 20 overnight at room 

25 temperature. Blots were then incubated for one hour in the same buffer containing 
primary antibody (at a 1:1000 dilution except for anti-BCG Hsp60 antibody which was 
used at a 1:500 dilution). Blots were washed 3 times (10 min each) with PBS with 
0.05% Tween 20 and incubated for an additional hour in PBS with 5% skim milk, 
0.05% Tween 20 and goat anti-rabbit IgG - alkaline phosphatase (AP) conjugate 

30 (Sigma) or goat-anti-murine IgG - alkaline phosphatase (AP) conjugate (Sigma) (at 
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1 : 1000 dilutions), respectively. After 3 washes in PBS with 0.05% Tween 20 as before, 
blots were soaked in alkaline phosphatase reaction buffer (100 mM Tris-HCl (pH 9.5), 
150 mM NaCl, 10 mM MgCl 2 ) and then developed in 0.05% nitroblue tetrazolium 
(NBT), 0.05% 5-bromo-4-chloro-3-indolyl phosphate (BCIP) in the same buffer, until 
5 signals were clearly visible (approximately 15 minutes). 

A similar procedure was followed for anti-histidine tag antibody, except 
that blocking was in 3% bovine serum albumin in TBS (10 mM Tris-HCl, pH 7.5, 150 
mM NaCl). Primary and secondary antibodies were diluted in TBS alone, and 
incubation with primary antibody (1:500 dilution) was for two hours. Washes were 

10 performed as follows: blots were first washed twice for 10 min in TBS containing 
0.05% Tween 20 and 0.2% Triton X-100, and once for 10 min in TBS. 

Recombinant histidine-tagged Hsp60 proteins were purified from 
overexpressing E. coli cells and probed on Western blot with polyclonal antibodies 
SPA-804 and anti-BCG Hsp60 as well as monoclonal antibodies SPA-870, SPA-807, 

15 and anti-histidine tag antibody. As is shown in Table 1, SPA-804 recognized all four 
Streptococcal Hsp60 proteins. In contrast, SPA-807 failed to crossreact with 
Streptococcus pneumoniae Hsp60-2, SPA-870 was unable to react with any 
Streptococcal Hsp60-2 protein, and anti-BCG Hsp60 failed to crossreact with any 
Streptococcal Hsp60. As predicted, anti-His tag antibody reacted with all recombinant 

20 proteins which had been expressed as His-tagged proteins. Positive reactivity is 
indicated as while lack of it is marked with 

TABLE 1 

25 Recognition of Streptococcal Hsp60 proteins by anti-Hsp60 antibodies 



Antibody 


S. pneumoniae 
Hsp60-1 


S. pneumoniae 
Hsp60-2 


5. pyogenes 
Hsp60-1 


S. pyogenes 
Hsp60-2 


SPA-804 


+ 


+ 


+ 


+ 


SPA-807 


+ 




+ 




SPA-870 


+ 




+ 




anti-BCG60 










anti-His tag 


+ 


+ 


+ 


+ 
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These data demonstrate that Streptococcal Hsp60 are antigenically 
distinct from Hsp60 of other organisms. They also show that Streptococcal Hsp60-1 
and Hsp60-2 can be distinguished. And, they provide evidence that related Hsp60s 
5 from two different Streptococcal species can be recognized differentially by an 
antibody. 

EXAMPLE 7 

PREPARATION AND IDENTIFICATION OF PEPTIDE FRAGMENTS OF 
RECOMBINANT STREPTOCOCCAL HSP60 

10 

Purified recombinant proteins (50 rag at 1 mg/ml) were digested with 2.5 
mg of Lys-C endopeptidase (Boehringer Mannheim) for 1 hour at 37°C. Digestion 
reactions were fractionated by capillary electrophoresis (3D-HPCE instrument, Hewlett- 
Packard). Reactions were run at 15 kV through a 75 u bare fused silica capillary in 50 

15 mM dibasic sodium phosphate (pH 7.47). Alternatively, reverse phase chromatography 
(1100 Series HPLC instrument, Hewlett-Packard) was carried out on a Hamilton PRP-1 
5 m column developed in a 0-60% acetonitrile gradient in water in the presence of 0.1% 
trifluoroacetic acid. Individual RP-HPLC-separated peptides of Hsp60 proteins were 
identified by mass spectroscopy by Hewlett-Packard Laboratories, Palo Alto, 

20 California. RP-HPLC chromatograms of digests of recombinant Streptoccocal Hsp60s 
are shown in Figure 11. 

EXAMPLE 8 

IDENTIFICATION OF ENDOGENOUS STREPTOCOCCAL HSP60 

25 

Total protein extracts from Streptococcus pneumoniae (ATCC6314) and 
Streptococcus pyogenes (ATCC 12344) were obtained from Dr. Lee Weber (University 
of Nevada, Reno). Equivalent amounts of both extracts (equalized based on intensity of 
staining of protein bands in SDS-PAGE gels) were fractionated by 10% SDS-PAGE 
30 alongside 50 ng of purified BGC Hsp60 (StressGen Biotechnologies). After 
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electrotransfer onto nitrocellulose, filters were blocked, probed with antibody SPA-804, 
and antibody signals detected as described in Example 6. 

Other, similarly prepared filters were incubated with a 1:3000 dilutions 
of antibodies SPA-807 or SPA-870 for one hour. Blots were rinsed twice with water, 
5 washed 3 times (5 min each) with PBS containing 0.05% Tween 20 and then incubated 
for an additional hour in PBS containing 5% skim milk, 0,05% Tween 20 and a 1:3000 
dilution of goat anti-rabbit IgG - horseradish peroxidase (HRP) conjugate (Sigma). 
Subsequently, filters were rinsed with water, washed with PBS containing 0.05% 
Tween 20 as before, equilibrated in ECL substrate mixture (Amersham), wrapped in 

1 0 plastic wrap and exposed to X-ray film for between 1 5 seconds and 20 minutes. 

The results from these experiments are summarized in Table 2. 
Antibody SPA-804 reacted strongly with both Streptococcal extracts. In contrast, 
antibody SPA-807 reacted weakly with extract from Streptococcus pneumoniae but 
strongly with extract from Streptococcus pyogenes. Finally, antibody SPA-870 reacted 

15 weakly with both Streptococcal extracts. Based on the antibody specificity determined 
in Example 6 (Table 1), it is concluded that Hsp60-2 is abundant in Streptococcal cells, 
whereas Hsp60-1 is only expressed at low levels. Presumably, Hsp60-1 is the more 
highly stress-inducible Hsp60 protein. 

20 TABLE 2 

Reactivity of selected anti-Hsp60 antibodies with protein extracts from 

& pneumoniae and s. pyogenes 



Antibody 


BCG60 control 
(50 ng) 


S, pneumoniae 
extract 


S. pyogenes 
extract 


SPA-804 


+++ 


+++ 


+++ 


SPA-807 


+++ 


+ 


+++ 


SPA-870 




+ 


+ a 



a: SPA870 detected protein with mobility different from predominant heavy 
25 band visualized in that extract with SPA-807. However, its mobility was close 

to the band detected in 5. pneumoniae extract with both SPA-870 and SPA-807 
antibodies. 

The amount of the utilized extracts was normalized by comparing 
Coomassie stained gels containing serial dilutions. 
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From the foregoing, it will be appreciated that, although specific 
embodiments of the invention have been described herein for purposes of illustration, 
various modifications may be made without deviating from the spirit and scope of the 
invention. Accordingly, the invention is not limited except as by the appended claims. 
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CLAIMS 

1. An isolated nucleic acid molecule encoding a Streptococcus 
pneumoniae Hsp60. 



2. An isolated nucleic acid molecule encoding a Streptococcus pyogenes 



Hsp60. 



3. An isolated nucleotide molecule selected from the group consisting of: 

(a) an isolated nucleic acid molecule comprising the sequence of SEQ ID 
NO:l from nucleotides 15-1652; 

(b) an isolated nucleic acid molecule comprising the sequence of SEQ ID 
NO:3 from nucleotides 15-1640; 

(c) an isolated nucleic acid molecule comprising the sequence of SEQ ID 
NO:5 from nucleotides 15-1649; 

(d) an isolated nucleic acid molecule comprising the sequence of SEQ ID 
NO:7 from nucleotides 15-1652; 

(e) an isolated nucleic acid molecule complementary to any one of the 
nucleotides of SEQ ID NOS:l, 3, 5 or 7 set forth in (a) through (d), respectively; and 

(f) an isolated nucleic acid molecule that hybridizes under conditions of 
high stringency to the nucleic acid molecules of any one of (a) through (e). 

4. An isolated nucleic acid molecule that specifically hybridizes to the 
nucleic acid molecule of any one of SEQ ID NO:l from nucleotides 15-1652, SEQ ID NO:3 
from nucleotides 15-1640, SEQ ID NO:5 from nucleotides 15-1649, or SEQ ID NO:7 from 
nucleotides 15-1652 or a complement thereof under conditions of high stringency. 



5. An isolated nucleic acid molecule comprising a nucleotide sequence 
that is identical to a segment comprising at least 25% of contiguous nucleotide bases of any 
one of SEQ ID NO:l from nucleotides 15-1652, SEQ ID NO:3 from nucleotides 15-1640, 
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SEQ ID NO:5 from nucleotides 15-1649, or SEQ ID NO:7 from nucleotides 15-1652 or a 
complement thereof. 

6. An isolated nucleic acid molecule encoding Hsp60 comprising a 
nucleic acid sequence that encodes a polypeptide comprising any one of SEQ ID NOS: 2, 4, 6 
or 8 or a variant Hsp60 that is at least 95% homologous to a polypeptide according to any one 
ofSEQIDNOS:2, 4, 6 or 8. 

7. An isolated nucleic acid molecule according to claim 3, encoding a 
polypeptide that is able to be selectively bound by an antibody specific for a Streptococcus 
pneumoniae Hsp60 or a Streptococcus pyogenes Hsp60. 

8. An isolated nucleic acid molecule encoding at least 8 amino acids of a 
Streptococcal Hsp60 polypeptide selected from amino acid residues 1-545 of SEQ ID NO:2, 
amino acid residues 1-541 of SEQ ID NO:4, amino acid residues 1-544 of SEQ ID NO:6, and 
amino acid residues 1-545 of SEQ ID NO: 8, wherein the encoded Streptococcal Hsp60 
polypeptide is able to bind to a major histocompatibility complex. 

9. An isolated Streptococcus pneumoniae Hsp60 polypeptide. 

10. An isolated Streptococcus pyogenes Hsp60 polypeptide. 

1 1 . An isolated Hsp60 polypeptide comprising the amino acid sequence of 
any one of a Streptococcal Hsp60 polypeptide selected from amino acid residues 1-545 of 
SEQ ID NO:2, amino acid residues 1-541 of SEQ ID NO:4, amino acid residues 1-544 of 
SEQ ID NO:6, and amino acid residues 1-545 of SEQ ID NO:8, or variants thereof, wherein 
the polypeptide is able to be selectively bound by an antibody specific for either a 
Streptococcus pneumoniae Hsp60 and/or Streptococcus pyogenes Hsp60. 
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12. The isolated Hsp60 polypeptide according to any one of claims 9-11, 
wherein the Hsp60 polypeptide is fused to an additional polypeptide to create a fusion 
protein. 

13. An isolated Hsp60 polypeptide comprising at least 8 amino acids 
selected from amino acid residues 1-545 of SEQ ID NO:2, amino acid residues 1-541 of SEQ 
ID NO:4, amino acid residues 1-544 of SEQ ID NO:6, and amino acid residues 1-545 of SEQ 
ID NO:8, wherein the Hsp60 polypeptide is capable of binding to a major histocompatibility 
complex and eliciting or enhancing an immune response to Streptococcus in a human being. 

14. The isolated Hsp60 polypeptide according to claim 11 wherein the 
polypeptide is derived from proteolytic cleavage. 

15. The isolated Hsp60 polypeptide according to claim 11 wherein the 
polypeptide is derived from chemical synthesis. 

16. The isolated Hsp60 according to claim 11 wherein the Hsp60 is an 
expression product of a transformed host cell containing a nucleic acid molecule encoding the 
Hsp60 or portion thereof. 

17. The isolated Hsp60 polypeptide according to claim 11 wherein the 
polypeptide comprises greater than 95% homology to any one of a Streptococcal Hsp60 
polypeptide selected from amino acid residues 1-545 of SEQ ID NO:2, amino acid residues 
1-5410 of SEQ ID NO:4, amino acid residues 1-544 of SEQ ID NO:6, and amino acid 
residues 1-545 of SEQ ID NO:8, and wherein the Hsp60 polypeptide is able to be selectively 
bound by an antibody specific for either a Streptococcus pneumoniae Hsp60 or Streptococcus 
pyogenes Hsp60 or both. 
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18. An isolated polypeptide wherein the polypeptide is an expression 
product of a transformed host cell containing the nucleic acid molecule of any one of claims 
1-8. 

19. A vector comprising an isolated nucleic acid molecule according to any 
one of claims 1-8. 

20. The vector according to claim 19 wherein the vector is an expression 
vector comprising a promoter in operative linkage with the isolated nucleic acid molecule 
encoding the Hsp60 or portion thereof. 

21. The vector according to claim 20, further comprising a selectable or 
identifiable marker. 

22. The vector according claim 20 wherein the promoter is a constitutive 
or an inducible promoter. 

23. A host cell containing a vector according to claim 19. 

24. The host cell according to claim 24 wherein the host cell is selected 
from the group consisting of a bacterial cell, a mammalian cell, a yeast cell and an insect cell. 

25. A composition comprising an Hsp60 polypeptide of any one of claims 
9-16 in combination with a pharmaceutically acceptable carrier or diluent. 

26. The composition according to claim 25 wherein the composition is 
suitable for systemic administration. 

27. The composition according to claim 25 wherein the composition is 
suitable for oral administration. 
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28. The composition according to claim 25 wherein the composition is 
suitable for parenteral administration. 

29. A method for eliciting or enhancing an immune response in a mammal 
against Streptococcus, comprising administering to the mammal an effective amount of an 
Hsp60 polypeptide according to any one of claims 9-16 in combination with a 
pharmaceutical^ acceptable carrier or diluent. 

30. A method for eliciting or enhancing an immune response in a mammal 
against a target antigen comprising administering to the mammal the target antigen joined to 
an Hsp60 polypeptide according to any one of claims 9-16 in combination with a 
pharmaceutically acceptable carrier or diluent. 

31. A composition comprising an isolated nucleic acid molecule of any 
one of claims 1-8 wherein the isolated nucleic acid molecule encodes a polypeptide having at 
least one amino acid difference from a corresponding polypeptide of an Hsp60 protein from 
an organism other than Streptococcus. 
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S. pnemnide Hsp60-1 gene CSEQ ID N0S:1 and 2) 

GAATTCGGCT TCAT AT6 GCG GCT AAA GAC GTA AAA nC GGT AAC GAC GCT CGT GTG AAA ATG CTG CGC GGC GTA AAC 77 

Met Ala Ala Lys Asp Val Lys Phe Gly Asn Asp Ala Arg Val Lys Met Leu Arg Gly Val Asn 21 

GTA CTG GCA GAT GCA GTG AAA GTT ACC CTC GGC CCA AAA GGC CGT AAC GTA GTT CTG GAT AAA TCT TTC GGT GCA 152 

Val Leu Ala Asp Ala Val Lys Val Thr Leu Gly Pro Lys Gly Arg Asn Val Val Leu Asp Lys Ser Phe Gly Ala 46 

CCG ACC ATC ACT AAA GAT GGT GTT TCC GTA GCA CGT GAA ATC GAA CTG GAA GAC AAG TTC GAA AAC ATG GGT GCG 227 

Pro Thr He Thr Lys Asp Gly Val Ser Val Ala Arg Glu He Glu Leu Glu Asp Lys Phe Glu Asn Met Gly Ala 71 

CAG ATG GTG AAA GAA GTT GCC TCT AAA GCG AAC GAC GCT GCA GGT GAC GGT ACC ACC ACC GCA ACC GTA CTG GCT 302 

Gin Met Val Lys Glu Val Ala Ser Lys Ala Asn Asp Ala Ala Gly Asp Gly Thr Thr Thr Ala Thr Val Leu Ala 96 

CAG TCC ATC ATC ACT GAA GGC CTG AAA GCC GTT GCT GCG GGC ATG AAC CCG ATG GAT CTG AAA CGT GGT ATC GAC 377 

Gin Ser He He Thr Glu Gly Leu Lys Ala Val Ala Ala Gly Met Asn Pro Met Asp Leu Lys Arg Gly He Asp 121 

AAA GCT GTC GCT GCT GCT GTT GAA GAA CTG AAA GCA CTG TCC GTA CCG TGC TCC GAC TCT AAA GCT ATT GCT CAG 452 

Lys Ala Val Ala Ala Ala Val Glu Glu Leu Lys Ala Leu Ser Val Pro Cys Ser Asp Ser Lys Ala lie Ala Gin 146 

GTT GGT ACC ATC TCC GCT AAC TCC GAC GAA ACC GTA GGT AAA CTG ATC GCT GAA GCG ATG GAC AAA GTC GGT AAA 527 

Val Gly Thr He Ser Ala Asn Ser Asp Glu Thr Val Gly Lys Leu He Ala Glu Ala Met Asp Lys Val Gly Lys 171 

GAA GGC GTG ATC ACC GTT GAA GAC GGT ACC GGT CTG CAG GAC GAA CTG GAC GTG GTT GAA GGT ATG CAG TTC GAC 602 

Glu Gly Val He Thr Val Glu Asp Gly Thr Gly Leu Gin Asp Glu Leu Asp Val Val Glu Gly Met Gin Phe Asp 196 

CGT GGC TAC CTG TCT CCT TAC TTC ATC AAC AAG CCG GAA ACT GGC GCA GTA GAA HG GAA AGC CCG TTC ATC CTG 677 
Arg Gly Tyr Leu Ser Pro Tyr Phe lie Asn Lys Pro Glu Thr Gly Ala Val Glu Leu Glu Ser Pro Phe He Leu 221 

CTG GCT GAC AAG AAA ATC TCC AAC ATC CGC GAA ATG CTG CCG GTT CTG GAA GCT GTA GCG AAA GCA GGC AAA CCG 752 
Leu Ala Asp Lys Lys He Ser Asn lie Arg Glu Met Leu Pro Val Leu Glu Ala Val Ala Lys Ala Gly Lys Pro 246 

CTG CTG ATC ATC GCT GAA GAT GTT GAA GGC GAA GCG CTG GCA ACT CTG GTT GTT AAC ACC ATG CGC GGT ATC GTA 827 
Leu Leu He He Ala Glu Asp Val Glu Gly Glu Ala Leu Ala Thr Leu Val Val Asn Thr Met Arg Gly He Val 271 

AAA GTC GCT GCG GTT AAA GCA CCT GGC TTC GGC GAT CGT CGT AAA GCA ATG CTG CAG GAT ATC GCT ACC CTG ACC 902 
Lys Val Ala Ala Val Lys Ala Pro Gly Phe Gly Asp Arg Arg Lys Ala Met Leu Gin Asp He Ala Thr Leu Thr 296 

GGT GGT ACC GTT ATC TCT GAA GAG ATC GGT ATG GAG CTG GAA AAA GCA ACT CTG GAA GAT CTG GGC CAG GCG AAA 977 
Gly Gly Thr Val He Ser Glu Glu He Gly Met Glu Leu Glu Lys Ala Thr Leu Glu Asp Leu Gly Gin Ala Lys 321 

CGC GTT GTT ATC AAC AAA GAT ACC ACC ACC ATC ATC GAT GGC GTG GGC GAC GAA GCT GCA ATC CAG GGT CGC GTG 1052 
Arg Val Val He Asn Lys Asp Thr Thr Thr He He Asp Gly Val Gly Asp Glu Ala Ala He Gin Gly Arg Val 346 
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ACT CAG ATT CGT CAG CAG ATC GAA GM GCA ACT TCC GAC TAT GAC CGT GAA AAA CTG CAG GAG CGC GTA GCG AAA 1127 

Thr Gin He Arg Gin Gin He Glu Glu Ala Thr Ser Asp Tyr Asp Arg Glu Lys Leu Gin 61 u Arg Val Ala Lys 371 

CTG GCA GGC GGC GTT GCG GTT ATC AAA GTT GGT GCT GCG ACT GAA GTT GAA ATG AAA GAG AAG AAA GCC CGC GTT 1202 

Leu Ala Gly Gly Val Ala Val He Lys Val Gly Ala Ala Thr Glu Val Glu Met Lys Glu Lys Lys Ala Arg Val 396 

GAA GAT GCC CTG CAC GCT ACC CGT GCT GCG GTA GAA GAA GGC GTG GTT GCT GGT GGT GGC GTT GCG CTG ATT CGC 1277 

Glu Asp Ala Leu His Ala Thr Arg Ala Ala Val Glu Glu Gly. Val Val Ala Gly Gly Gly Val Ala Leu He Arg 421 

GTA GCG TCT AAA ATT GCC GGC CTG AAA GGT CAG AAC GAA GAC CAG AAC GTA GGT ATC AAA GTT GCG CTG CGC GCA 1352 

Val Ala Ser Lys He Ala Gly Leu Lys Gly Gin Asn Glu Asp Gin Asn Val Gly lie Lys Val Ala Leu Arg Ala 446 

ATG GAA TCC CCA CTG CGT CAA ATC GTA CTG AAC TGC GGC GAA GAG CCG TCT GTA GTG GCT AAC ACC GTG AAA GCC 1427 

Met Glu Ser Pro Leu Arg Gin He Val Leu Asn Cys Gly Glu Glu Pro Ser Val Val Ala Asn Thr Val Lys Ala 471 

GGT GAC GGT AAC TAC GGT TAC AAC GCT GCA ACT GAA GAA TAC GGC AAC ATG ATC GAC ATG GGT ATC CTG GAT CCA 1502 

Gly Asp Gly Asn Tyr Gly Tyr Asn Ala Ala Thr Glu Glu Tyr Gly Asn Met He Asp Met Gly He Leu Asp Pro 496 

ACC AAA GTA ACT CGT TCT GCT CTG CAG TAC GCG GCT TCT GTT GCG GGT CTG ATG ATC ACC ACC GAG TGC ATG GTT 1577 

Thr Lys Val Thr Arg Ser Ala Leu Gin Tyr Ala Ala Ser Val Ala Gly Leu Met He Thr Thr Glu Cys Met Val 521 

ACC GAC CTG CCG AAA GGC GAT GCA CCT GAC TTA GGT GCT GCT GGT GGT ATG GGC GGC ATG GGC GGA ATG ATG TGA 1652 

Thr Asp Leu Pro Lys Gly Asp Ala Pro Asp Leu Gly Ala Ala Gly Gly Met Gly Gly Met Gly Gly Met Met * 546 

TCAAGCC GAATTC 1663 
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S. pneumoniae Hsp60-2 gene (SEQ ID N0S:3 and 4) 

GMTTCGGCT TCAT ATG GCA AAA GAA ATT AAA TTT TCA TCA GAT GCC CGT TCA GCT ATG GTC CGT GGT GTC GAT ATC 77 

Met Ala Lys 61 u He Lys Phe Ser Ser Asp Ala Arg Ser Ala Met Val Arg Gly Val Asp He 21 

CTT GCA GAT ACT GTT AAA GTA ACT TTG GGA CCA AAA GGT CGC AAT GTC GTT CTT GAA AAG TCA TTC GGT TCA CCC 152 

Leu Ala Asp Thr Val Lys Val Thr Leu Gly Pro Lys Gly Arg Asn Val Val Leu Glu Lys Ser Phe Gly Ser Pro 46 

TTG ATT ACC AAT GAC GGT GTG ACT ATT GCC AAA GAA An GAA TTA GAA GAC CAT TTT GAA AAT ATG GGT GCC AAA 227 

Leu lie Thr Asn Asp Gly Val Thr He Ala Lys Glu He Glu Leu Glu Asp His Phe Glu Asn Met Gly Ala Lys 71 

TTG GTA TCA GAA GTA GCT TCA AAA ACC AAT GAT ATC GCA GGT GAT GGA ACT ACA ACT GCA ACT GTT TTG ACC CAA 302 

Leu Val Ser Glu Val Ala Ser Lys Thr Asn Asp He Ala Gly Asp Gly Thr Thr Thr Ala Thr Val Leu Thr Gin 96 

GCA ATC GTC CGT GAA GGA ATC AAA AAC GTC ACA GCA GGT GCA AAT CCA ATC GGT ATT CGT CGT GGG ATT GAA ACA 377 

Ala He Val Arg Glu Gly He Lys Asn Val Thr Ala Gly Ala Asn Pro He Gly He Arg Arg Gly He Glu Thr 121 

GCA GTT GCC GCA GCA GTT GAA GCT TTG AAA AAC AAC GTC ATC CCT GTT GCC AAT AAA GAA GCT ATC GCT CAA GTT 452 

Ala Val Ala Ala Ala Val Glu Ala Leu Lys -Asn Asn Val He Pro Val Ala Asn Lys Glu Ala He Ala Gin Val 146 

GCA GCC GTA TCT TCT CGT TCT GAA AAA GTT GGT GAG TAC ATC TCT GAA GCA ATG GAA AAA GTT GGC AAA GAC GGT 527 

Ala Ala Val Ser Ser Arg Ser Glu Lys Val Gly Glu Tyr He Ser Glu Ala Met Glu Lys Val Gly Lys Asp Gly 171 

GTC ATC ACC ATC GAA GAG TCA CGT GGT ATG GAA ACA GAG CTT GAA GTC GTA GAA GGA ATG CAG TTT GAC CGT GGT 602 

Val He Thr He Glu Glu Ser Arg Gly Met Glu Thr Glu Leu Glu Val Val Glu Gly Met Gin Phe Asp Arg Gly 196 

TAC CTT TCA CAG TAC ATG GTG ACA GAT AGC GAA AAA ATG GTG GCT GAC CTT GAA AAT CCG TAC ATT TTG ATT ACA 677 

Tyr Leu Ser Gin Tyr Met Val Thr Asp Ser Glu Lys Met Val Ala Asp Leu Glu Asn Pro Tyr He Leu He Thr 221 

GAC AAG AAA ATT TCC AAT ATC CAA GAA ATC TTG CCA CTT TTG GAA AGC ATT CTC CAA AGC AAT CGT CCA CTC TTG 752 

Asp Lys Lys He Ser Asn He Gin Glu He Leu Pro Leu Leu Glu Ser He Leu Gin Ser Asn Arg Pro Leu Leu 246 

ATT ATT GCG GAT GAT GTG GAT GGT GAG GCT CTT CCA ACT CTT GTT TTG AAC AAG ATT CGT GGA ACC TTC AAC GTA B27 

He He Ala Asp Asp Val Asp Gly Glu Ala Leu Pro Thr Leu Val Leu Asn Lys He Arg Gly Thr Phe Asn Val 271 

GTA GCA GTC AAG GCA CCT GGT TTT GGT GAC CGT CGC AAA GCC ATG CTT GAA GAT ATC GCC ATC TTA ACA GGC GGA 902 
Val Ala Val Lys Ala Pro Gly Phe Gly Asp Arg Arg Lys Ala Met Leu Glu Asp lie Ala lie Leu Thr Gly Gly 296 

ACA GTT ATC ACA GAA GAC CTT GGT CTT GAG TTG AAA GAT GCG ACA ATT GAA GCT CTT GGT CAA GCA GCG AGA GTG 977 
Thr Val He Thr Glu Asp Leu Gly Leu Glu Leu Lys Asp Ala Thr He Glu Ala Leu Gly Gin Ala Ala Arg Val 321 

ACC GTG GAC AAA GAT AGC ACG GTT ATT GTA GAA GGT GCA GGA AAT CCT GAA GCG ATT TCT CAC CGT GTT GCG GTT 1052 
Thr Val Asp Lys Asp Ser Thr Val He Val Glu Gly Ala Gly Asn Pro Glu Ala He Ser His Arg Val Ala Val 346 
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ATC AAG TCT CM ATC GAA ACT ACA ACT TCT GAA TTT GAC CGT GAA AAA TTG CAA GAA CGC TTG GCC AAA TTG TCA 1127 

He Lys Ser Gin He Glu Thr Thr Thr Ser Glu Phe Asp Arg Glu Lys Leu Gin Glu Arg Leu Ala Lys Leu Ser 371 

GGT GGT GTA GCG GTT ATT AAG GTC GGA GCC GCA ACT GAA ACT GAG TTG AAA GAA ATG AAA CTC CGC ATT GAA GAT 1202 

Gly Gly Val Ala Val He Lys Val Gly Ala Ala Thr Glu Thr Glu Leu Lys Glu Met Lys Leu Arg He Glu Asp 396 

GCC CTC AAC GCT ACT CGT GCA GCT GTT GAA GAA GGT ATT GTT GCA GGT GGT GGA ACA GCT CTT GCC AAT GTG ATT 1277 

Ala Leu Asn Ala Thr Arg Ala Ala Val Glu Glu Gly He Val Ala Gly Gly Gly Thr Ala Leu Ala Asn Val He 421 

CCA GCT GTT GCT ACC TTG GAA TTG ACA GGA GAT GAA GCA ACA GGA CGT AAT ATT GTT CTC CGT GCT TTG GAA GAA 1352 

Pro Ala Val Ala Thr Leu Glu Leu Thr Gly Asp Glu Ala Thr Gly Arg Asn lie Val Leu Arg Ala Leu Glu Glu 446 

CCT GTT CGT CAA ATT GCT CAC AAT GCA GGA TTT GAA GGA TCT ATC GTT ATC GAT CGT TTG AAA AAT GCT GAG CTT 1427 

Pro Val Arg Gin He Ala His Asn Ala Gly Phe Glu Gly Ser He Val He Asp Arg Leu Lys Asn Ala Glu Leu 471 

GGT ATA GGA TTC AAC GCA GCA ACT GGC GAG TGG GTT AAC ATG An GAT CAA GGT ATC ATT GAT CCA GTT AAA GTG 1502 

Gly lie Gly Phe Asn Ala Ala Thr Gly Glu Trp Val Asn Met He Asp Gin Gly lie lie Asp Pro Val Lys Val 496 

AGT CGT TCA GCC CTA CAA AAT GCA GCA TCT GTA GCC AGC TTG ATT TTG ACA ACA GAA GCA GTC GTA GCC AAT AAA 1577 

Ser Arg Ser Ala Leu Gin Asn Ala Ala Ser Val Ala Ser Leu He Leu Thr Thr Glu Ala Val Val Ala Asn Lys 521 

CCA GAA CCA GTA GCC CCA GCT CCA GCA ATG GAT CCA AGT ATG ATG GGT GGA ATG GGC GGA TGA TCAAAGC CGAATTC 1654 

Pro Glu Pro Val Ala Pro Ala Pro Ala Met Asp Pro Ser Met Met Gly Gly Met Gly Gly * 542 
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S. pyogenes Hsp60-1 gene (SEQ ID NOS: 5 and 6) 

6AATTCGGCT TCAT ATG GCG GCT AAA GAT GTA AAA TTC GGT AAC GAC GCT CGT GTA AAA ATG CTC CGC GGC GTA AAC 77 

Met Ala Ala Lys Asp Val Lys Phe Gly Asn Asp Ala Arg Val Lys Met Leu Arg Gly Val Asn 21 

GTA CTG GCA GAC GCA GTT AAA GTA ACC CTG GGC .CCG AAA GGC CGT AAC GTA GTG CTG GAC AAA TCC TTC GGC GCG 152 

Val Leu Ala Asp Ala Val Lys Val Thr Leu Gly Pro Lys Gly Arg Asn Val Val Leu Asp Lys Ser Phe Gly Ala 46 

CCA ACC ATC ACG AAA GAT GGT GTT TCT GTA GCA CGT GAA ATC GAG CTG GAA GAC AAG TTC GAA AAC ATG GGC GCG 227 

Pro Thr He Thr Lys Asp Gly Val Ser Val Ala Arg Glu He Glu Leu Glu Asp Lys Phe Glu Asn Met Gly Ala 71 

CAG ATG GTG AAA GAA GTG GCC TCT AAA GCG AAC GAC GCT GCA GGC GAC GGT ACC ACC ACC GCG ACC GTG CTG GCT 302 
Gin Met Val Lys Glu Val Ala Ser Lys Ala Asn Asp Ala Ala Gly Asp Gly Thr Thr Thr Ala Thr Val Leu Ala 96 

CAG GCT ATC ATC ACC GAA GGT CTG AAA GCC GTT GCT GCG GGC ATG AAC CCA ATG GAT CTG AAA CGT GGT ATC GAC 377 

Gin Ala He He Thr Glu Gly Leu Lys Ala Val Ala Ala Gly Met Asn Pro Met Asp Leu Lys Arg Gly He Asp 121 

AAA GCT GTC GCG TCC GCT GTT GAA GAA CTG AAA GCG CTG TCC GTA CCG TGC TCT GAC TCT AAA GCC ATT GCT CAG 452 

Lys Ala Val Ala Ser Ala Val Glu Glu Leu Lys Ala Leu Ser Val Pro Cys Ser Asp Ser Lys Ala He Ala Gin 146 

GTA GGT ACC ATC TCC GCT AAC TCC GAC GAA ACC GTA GGT AAA CTG ATC GCG GAA GCG ATG GAT AAA GTC GGT AAA 527 

Val Gly Thr He Ser Ala Asn Ser Asp Glu Thr Val Gly Lys Leu He Ala Glu Ala Met Asp Lys Val Gly Lys 171 

GAA GGC GTG ATC ACC GTT GAA GAC GGT ACC GGT CTG GAA GAC GAA CTG GAC GTG GTT GAA GGT ATG CAG TTC GAC 602 

Glu Gly Val He Thr Val Glu Asp Gly Thr Gly Leu Glu Asp Glu Leu Asp Val Val Glu Gly Met Gin Phe Asp 196 

CGC GGT TAC CTG TCC CCA TAC TTC ATC AAC AAG CCA GAA ACT GGC GCT GTT GAG CTG GAA AGC CCG TTC ATC CTG 677 

Arg Gly Tyr Leu Ser Pro Tyr Phe He Asn Lys Pro Glu Thr Gly Ala Val Glu Leu Glu Ser Pro Phe He Leu 221 

CTG GCT GAC AAG AAA ATC TCC AAC ATC CGC GAA ATG CTG CCA GTG CTG GAA GCC GTT GCG AAA GCA GGC AAA CCG 752 

Leu Ala Asp Lys Lys He Ser Asn He Arg Glu Met Leu Pro Val Leu Glu Ala Val Ala Lys Ala Gly Lys Pro 246 

CTG GTT ATC ATT GCT GAA GAC GTT GAA GGC GAA GCG CTG GCG ACC CTG GTG GTT AAC ACC ATG CGT GGC ATC GTG 827 
Leu Val He He Ala Glu Asp Val Glu Gly Glu Ala Leu Ala Thr Leu Val Val Asn Thr Met Arg Gly He Val 271 

AAA GTG GCT GCG GTT AAA GCA CCT GGC TTC GGC GAC CGC CGT AAA GCG ATG CTG CAG GAT ATC GCT ACC CTG ACC 902 
Lys Val Ala Ala Val Lys Ala Pro Gly Phe Gly Asp Arg Arg Lys Ala Met Leu Gin Asp He Ala Thr Leu Thr 296 

GGC GGT ACC GTC ATC TCT GAA GAG ATC GGT ATG GAG CTG GAA AAA GCG ACC CTG GAA GAC CTG GGC CAG GCT AAA 977 
Gly Gly Thr Val lie Ser Glu Glu He Gly Met Glu Leu Glu Lys Ala Thr Leu Glu Asp Leu Gly Gin Ala Lys 321 

CGT GTT GTG ATC AAC AAA GAC ACC ACC ACC ATC ATC GAT GGC GTG GGC GAC GAA GCG GCG ATT CAG GGC CGT GTT 1052 
Arg Val Val lie Asn Lys Asp Thr Thr Thr He He Asp Gly Val Gly Asp Glu Ala Ala lie Gin Gly Arg Val 346 
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GGT CAG ATC CGT AAG CAG ATC GAA GAA GCC ACT TCC GAT TAC GAC CGT GAA AAA CTG CAG GAG CGC GTA GCG AAA 1127 

Gly Gin He Arg Lys Gin He Glu Glu Ala Thr Ser Asp Tyr Asp Arg Glu Lys Leu Gin Glu Arg Val Ala Lys 371 

CTG GCA GGC GGT GTT GCG GTA ATC AAA GTC GGT GCT GCG ACT GAA GTT GAA ATG AAA GAG AAA AAA GCA CGC GTT 1202 

Leu Ala Gly Gly Val Ala Val He Lys Val Gly Ala Ala Thr Glu Val Glu Met Lys Glu Lys Lys Ala Arg Val 396 

GAC GAT GCC CTG CAC GCG ACC CGT GCT GCG GTA GAA GAA GGC GTG GTT GCT GGT GGT GGT GTG GCG CTG GTG CGT 1277 

Asp Asp Ala Leu His Ala Thr Arg Ala Ala Val Glu Glu Gly Val Val Ala Gly Gly Gly Val Ala Leu Val Arg 421 

GTT GCC GCG AAA CTG TCC GGC CTG ACT GCT CAG AAC GAA GAT CAG AAC GTG GGT ATC AAA GTT GCG CTG CGC GCA 1352 

Val Ala Ala Lys Leu Ser Gly Leu Thr Ala Gin Asn Glu Asp Gin Asn Val Gly He Lys Val Ala Leu Arg Ala 446 

ATG GAA GCT CCA CTG CGT CAG ATC GTG TCC AAC GCC GGT GAA GAG CCA TCT GTT GTG ACC AAC AAC GTG AAA GCA 1427 

Met Glu Ala Pro Leu Arg Gin He Val Ser Asn Ala Gly Glu Glu Pro Ser Val Val Thr Asn Asn Val Lys Ala 471 

GGC GAA GGT AAC TAC GGT TAC AAC GCA GCA ACT GAA GAA TAC GGC AAC ATG ATC GAC TTC GGT ATC CTG GAT CCA 1502 

Gly Glu Gly Asn Tyr Gly Tyr Asn Ala Ala Thr Glu Glu Tyr Gly Asn Met He Asp Phe Gly He Leu Asp Pro 496 

ACC AAA GTG ACC CGT TCT GCT CTG CAG TAC GCG GCA TCT GTC GCT GGC CTG ATG ATC ACC ACC GAG TGC ATG GTG 1577 

Thr Lys Val Thr Arg Ser Ala Leu Gin Tyr Ala Ala Ser Val Ala Gly Leu Met He Thr Thr Glu Cys Met Val 521 

ACC GAC CTG CCT AAA GGC GAC GCA CCT GAC TTA GGT GCT GCA GGC ATG GGT GGG ATG GGC GGT ATG ATG TGA TCAA 1653 

Thr Asp Leu Pro Lys Gly Asp Ala Pro Asp Leu Gly Ala Ala Gly Met Gly Gly Met Gly Gly Met Met ' 545 

GCC GAATTC 1652 
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S. pyogenes Hsp60-2 gene (SEQ ID NOS: 7 and 8) 

GMTTCGGCT TCAT ATG 6CA AAA GAA ATC AAA TTT TCA GCA GAT GCG CGT GCT GCC ATG GTG CGC GGA GTT GAT ATG 77 

Met Ala Lys Glu He Lys Phe Ser Ala Asp Ala Arg Ala Ala Met Val Arg Gly Val Asp Met 21 

TTA GCA GAT ACC GTC AM GTA ACG CTT GGT CCT AAA GGG CGC AAT GTT GTT CTT GAA AAA GCT TTT GGT TCT CCC 152 

Leu Ala Asp Thr Val Lys Val Thr Leu Gly Pro Lys Gly Arg Asn Val Val Leu Glu Lys Ala Phe Gly Ser Pro 46 

TTA ATT ACT AAT GAC GGG GTA ACC ATT GCT AAA GAG ATC GAA TTA GAA GAT CAT TTT GAA AAC ATG GGA GCA AAA 227 

Leu He Thr Asn Asp Gly Val Thr He Ala Lys Glu He Glu Leu Glu Asp His Phe Glu Asn Met Gly Ala Lys 71 

TTG GTG TCT GAA GTG GCT TCT AAA ACC AAT GAT ATT GCT GGT GAT GGG ACG ACT ACT GCA ACA GTT TTG ACA CM 302 

Leu Val Ser Glu Val Ala Ser Lys Thr Asn Asp He Ala Gly Asp Gly Thr Thr Thr Ala Thr Val Leu Thr Gin 96 

GCC An GTT CAT GAA GGA CTA AAA AAT GTG ACA GCA GGT GCT AAT CCA ATT GGT ATC CGT CGA GGC ATT GAA ACA 377 

Ala He Val His Glu Gly Leu Lys Asn Val Thr Ala Gly Ala Asn Pro He Gly He Arg Arg Gly He Glu Thr 121 

GCA ACA GCA ACA GCT GTT GAA GCC TTG AAA GCC ATT GCT CAA CCT GTA TCT GGC AAG GAA GCT An GCT CAG GTC 452 

Ala Thr Ala Thr Ala Val Glu Ala Leu Lys Ala He Ala Gin Pro Val Ser Gly Lys Glu Ala He Ala Gin Val 146 

GCT GCA GTA TCA TCA CGC TCT GAA AAA GTT GGA GAG TAT ATC TCA GAA GCT ATG GAG CGT GTG GGC AAC GAT GGT 527 

Ala Ala Val Ser Ser Arg Ser Glu Lys Val Gly Glu Tyr lie Ser Glu Ala Met Glu Arg Val Gly Asn Asp Gly 171 

GTG ATT ACC ATC GAA GAA TCT CGA GGT ATG GAA ACA GAA CTT GAA GTG GTT GAA GGC ATG CAA TTT GAC CGT GGT 602 

Val He Thr He Glu Glu Ser Arg Gly Met Glu Thr Glu Leu Glu Val Val Glu Gly Met Gin Phe Asp Arg Gly 196 

TAC CTG TCT CAA TAC ATG GTC ACA GAC AAT GAA AAA ATG GTT GCA GAC CTT GAA AAC CCA TTT ATC TTA ATC ACG 677 

Tyr Leu Ser Gin Tyr Met Val Thr Asp Asn Glu Lys Met Val Ala Asp Leu Glu Asn Pro Phe He Leu He Thr 221 

GAT AAA AAA GTG TCA AAC ATC CAA GAC An nG CCA CTA CTT GAG GAA Gn CTT AAA ACC AAC CGT CCA TO CTC 752 

Asp Lys Lys Val Ser Asn He Gin Asp He Leu Pro Leu Leu Glu Glu Val Leu Lys Thr Asn Arg Pro Leu Leu 246 

An An GCA GAT GAT GTG GAT GGT GAA GCA CTT CCA ACC OT GTC nG AAC AAG An CGT GGT ACT nC AAT GTG 827 

lie He Ala Asp Asp Val Asp Gly Glu Ala Leu Pro Thr Leu Val Leu Asn Lys He Arg Gly Thr Phe Asn Val 271 

Gn GCT GTC AAA GCG CCA GGA TTT GGT GAT CGT CGT AAA GCT ATG CTT GAA GAC An GCT ATC nG ACA GGT GGT 902 

Val Ala Val Lys Ala Pro Gly Phe Gly Asp Arg Arg Lys Ala Met Leu Glu Asp He Ala He Leu Thr Gly Gly 296 

ACA GTG An ACA GAG GAT CTA GGA CTT GAA TO AAA GAT GCT ACA ATG ACA GCC CTT GGA CAG GCT GCT AAG An 977 

Thr Val He Thr Glu Asp Leu Gly Leu Glu Leu Lys Asp Ala Thr Met Thr Ala Leu Gly Gin Ala Ala Lys He 321 

ACA Gn GAT AAA GAT AGC ACA GTA An Gn GAA GGT TCA GGA ACT TCA GAA GCT An GCT AAC CGT An GCA CTG 1052 

Thr Val Asp Lys Asp Ser Thr Val He Val Glu Gly Ser Gly Ser Ser Glu Ala He Ala Asn Arg He Ala Leu 346 



Fig. 4A 



SUBSTITUTE SHEET (RULE 26) 



WO 99/35270 



PCT/CA98/01203 



8/22 

ATT AAA TCG CAA TTA GAA ACA ACA ACT TCT GAC TFT GAC CGT GAA AAA CTA CAA GAA CGT TTG GCG AAA TTA GCT 1127 

He Lys Ser Gin Leu Glu Thr Thr Thr Ser Asp Phe Asp Arg Glu Lys Leu Gin Glu Arg Leu Ala Lys Leu Ala 371 

GGT GGT GTA GCT GTT ATC AAA GTA GGA GCT CCA ACA GAG ACA GCT TTA AAA GAA ATG AAA CTT CGC ATT GAG GAT 1202 

Gly Gly Val Ala Val He Lys Val Gly Ala Pro Thr Glu Thr Ala Leu Lys Glu Met Lys Leu Arg lie Glu Asp 396 

GCT CTA AAT GCT ACA CGT GCA GCC GTT GAA GAA GGT ATC GTT GCT GGT GGT GGA ACA GCA CTT ATT ACG GTT ATT 1277 

Ala Leu Asn Ala Thr Arg Ala Ala Val Glu Glu Gly He Val Ala Gly Gly Gly Thr Ala Leu He Thr Val He 421 

GAA AAA GTA GCA GCT CTT GAG CTT GAG GGC GAT GAT GCT ACT GGA CGT AAC ATT GTG CTT CGT GCT CTA GAA GAG 1352 

Glu Lys Val Ala Ala Leu Glu Leu Glu Gly Asp Asp Ala Thr Gly Arg Asn He Val Leu Arg Ala Leu Glu Glu 446 

CCT GTA CGT CAA ATT GCT TTA AAT GCT GGG TAC GAA GGC TCC GTA GTT ATT GAC AAG TTG AAA AAC AGC CCT GCA 1427 

Pro Val Arg Gin lie Ala Leu Asn Ala Gly Tyr Glu Gly Ser Val Val He Asp Lys Leu Lys Asn Ser Pro Ala 471 

GGA ACA GGA TTT AAT GCT GCA ACA GGT GAG TGG GTT GAT ATG An AAA ACA GGA ATC ATT GAC CCT GTC AAA GTA 1502 

Gly Thr Gly Phe Asn Ala Ala Thr Gly Glu Trp Val Asp Met He Lys Thr Gly He He Asp Pro Val Lys Val 496 

ACA CGA TCA GCG CTT CAA AAT GCA GCT TCT GTA GCT AGT CTT ATT TTG ACA ACA GAA GCA GTT GTT GCT AAT AAA 1577 

Thr Arg Ser Ala Leu Gin Asn Ala Ala Ser Val Ala Ser Leu He Leu Thr Thr Glu Ala Val Val Ala Asn Lys 521 

CCT GAA CCA GCT ACG CCA GCG CCA GCA ATG CCA GCA GGT ATG GAT CCA GGA ATG ATG GGT GGG ATG GGC GGA TAA 1652 

Pro Glu Pro Ala Thr Pro Ala Pro Ala Met Pro Ala Gly Met Asp Pro Gly Met Met Gly Gly Met Gly Gly ' 546 

GCCGAAT TC 1661 
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Sequencing strategy (scale: 1cm=approx. 1 0Obp) 

W229 W244 W228 W7n7 

M13F W2U _ W200 W227 wTi 8 



P60-1 



W230 W232 ^ W22T M13R 



W220 w22 6 



W224 

W214 W245 . 

M13F ^ -W233 W255^ W215 



P5Q-2 k'vV's^^^'v^^^^^^^v^aVv^^v^v ^'a^ ^ '^ ^^^s^ \s\s\v^ -sssm 

W2T8 " ^W236 _- ^ M13R 



W217 



W234 W219 W237" 
W225 W228 



W195__^-= W227~~ ~W202 
M13F w , 178 W201 W198 

Y60-1 ' "jus*/"* ;;;; ;;;;;;;; / /////// <> <> <> / ///s / / <> / / / / / // <> s / <> t zza 

W196 ' W204 W179 M13R 

^203- W224 W226 W205 W19? 

UMQO W206 W241 W208 . 

M13R wi£ g W238 W207 

Y60-2 ! - • i ■ ;■• ••• - ; • ' ■ ■ ■; ■ ■' 

" "W209 W240 W210 M13F 

W239" W242 W191 
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PCT/CA98/01203 



Nco I 6971 
Xba I 6932 
SgrA I 6824 
Sph I 6673 



Bsp 120 I 
Apa 1 



PshA I 5299 
PpuM I 5037 




58 Ned I 
154 Sfi I 
230 Pmi i 
256 BstB I 
366 Siu I 




base pairs 
Unique Sites 



1542 BamH I 
1706 EcoR I 
1712 Ecl136 I 
1712 Sac I 
1719 Sal I 
1725 Hind III 

1731 Not I 

1732 Eag I 
1740 PaeR7 I 

i 1740 Xho I 

■ 1818 Bpu1102 



Bst1 1 




AlwN I 3629 



2969 Xma I 
3186 Nru I 
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Nco I 6960 
Xba I 6921 
Bgl II 6855 
SgrA I 6813 
Sph I 6662 



58 Nde 



Mlu I 6133 

Bsp120 I 5926 
Apa I 5926 

BssH II 5722 



Bgl I 5070 
Fsp I 5053 
PpuM I 5026 




Mun I 
160 Msc I 
1653 BamH 
1695 
1701 
1701 
1708 
1720 
1721 
i_ 1729 
= 1729 



EcoR 
Ecl136 
Sac I 
Sal I 
Not I 
Eag I 
PaeR7 
Xho I 



I 



2131 Dra 



Bst1 107 I 4263 



2958 Xma 
3175 Nru I 
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Nco I 6968 
Xba I 6929 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: Stessgen Biotechnologies Corporation 

(ii) TITLE OF INVENTION: STREPTOCOCCAL HEAT SHOCK PROTEINS OF THE 
HSP60 FAMILY 

(iii) NUMBER OF SEQUENCES: 91 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Gowling, Strathy & Henderson 

(B) STREET: Commerce Court West, Suite 4900 

(C) CITY: Toronto 

(D) STATE: Ontario 

(E) COUNTRY: Canada 

(F) ZIP: M5L 1J3 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.30 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 2 9 December 1998 

(C) CLASSIFICATION: 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Omar A. Nassif 

(B) REGISTRATION NUMBER: 4 016 

(C) REFERENCE /DOCKET NUMBER : T8464440WO 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (416) 862-7525 

(B) TELEFAX: (416) 862-7661 



(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1665 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ix) FEATURE: 

(A) NAME/ KEY: CDS 

(B) LOCATION: 15.. 1649 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

GAATTCGGCT TCAT ATG GCG GCT AAA GAC GTA AAA TTC GGT AAC GAC GCT 50 
Met Ala Ala Lys Asp Val Lys Phe Gly Asn Asp Ala 
15 10 

CGT GTG AAA ATG CTG CGC GGC GTA AAC GTA CTG GCA GAT GCA GTG AAA 98 
Arg Val Lys Met Leu Arg Gly Val Asn Val Leu Ala Asp Ala Val Lys 
15 20 25 

GTT ACC CTC GGC CCA AAA GGC CGT AAC GTA GTT CTG GAT AAA TCT TTC 146 
Val Thr Leu Gly Pro Lys Gly Arg Asn Val Val Leu Asp Lys Ser Phe 
30 35 40 

GGT GCA CCG ACC ATC ACT AAA GAT GGT GTT TCC GTA GCA CGT GAA ATC 194 
Gly Ala Pro Thr lie Thr Lys Asp Gly Val Ser Val Ala Arg Glu lie 
45 50 55 60 

GAA CTG GAA GAC AAG TTC GAA AAC ATG GGT GCG CAG ATG GTG AAA GAA 242 
Glu Leu Glu Asp Lys Phe Glu Asn Met Gly Ala Gin Met Val Lys Glu 
65 70 75 

GTT GCC TCT AAA GCG AAC GAC GCT GCA GGT GAC GGT ACC ACC ACC GCA 290 
Val Ala Ser Lys Ala Asn Asp Ala Ala Gly Asp Gly Thr Thr Thr Ala 
80 85 90 

ACC GTA CTG GCT CAG TCC ATC ATC ACT GAA GGC CTG AAA GCC GTT GCT 338 
Thr Val Leu Ala Gin Ser lie lie Thr Glu Gly Leu Lys Ala Val Ala 
95 100 105 

GCG GGC ATG AAC CCG ATG GAT CTG AAA CGT GGT ATC GAC AAA GCT GTC 386 
Ala Gly Met Asn Pro Met Asp Leu Lys Arg Gly lie Asp Lys Ala Val 
110 115 120 

GCT GCT GCT GTT GAA GAA CTG AAA GCA CTG TCC GTA CCG TGC TCC GAC 4 34 

Ala Ala Ala Val Glu Glu Leu Lys Ala Leu Ser Val Pro Cys Ser Asp 
125 130 135 140 

TCT AAA GCT ATT GCT CAG GTT GGT ACC ATC TCC GCT AAC TCC GAC GAA 4 82 

Ser Lys Ala lie Ala Gin Val Gly Thr lie Ser Ala Asn Ser Asp Glu 
145 150 155 

ACC GTA GGT AAA CTG ATC GCT GAA GCG ATG GAC AAA GTC GGT AAA GAA 53 0 

Thr Val Gly Lys Leu lie Ala Glu Ala Met Asp Lys Val Gly Lys Glu 
160 165 170 

GGC GTG ATC ACC GTT GAA GAC GGT ACC GGT CTG CAG GAC GAA CTG GAC 578 
Gly Val l Ile Thr Val Glu Asp Gly Thr Gly Leu Gin Asp Glu Leu Asp 
175 180 185 

GTG GTT GAA GGT ATG CAG TTC GAC CGT GGC TAC CTG TCT CCT TAC TTC 62 6 

Val Val Glu Gly Met Gin Phe Asp Arg Gly Tyr Leu Ser Pro Tyr Phe 
190 195 200 
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ATC AAC AAG CCG GAA ACT GGC GCA GTA GAA TTG GAA AGC CCG TTC ATC 674 
lie Asn Lys Pro Glu Thr Gly Ala Val Glu Leu Glu Ser Pro Phe lie 
205 210 215 220 

CTG CTG GCT GAC AAG AAA ATC TCC AAC ATC CGC GAA ATG CTG CCG GTT 722 
Leu Leu Ala Asp Lys Lys lie Ser Asn lie Arg Glu Met Leu Pro Val 
225 230 235 

CTG GAA GCT GTA GCG AAA GCA GGC AAA CCG CTG CTG ATC ATC GCT GAA 770 
Leu Glu Ala Val Ala Lys Ala Gly Lys Pro Leu Leu lie He Ala Glu 
240 245 250 

GAT GTT GAA GGC GAA GCG CTG GCA ACT CTG GTT GTT AAC ACC ATG CGC 818 
Asp Val Glu Gly Glu Ala Leu Ala Thr Leu Val Val Asn Thr Met Arg 
255 260 265 

GGT ATC GTA AAA GTC GCT GCG GTT AAA GCA CCT GGC TTC GGC GAT CGT 866 
Gly He Val Lys Val Ala Ala Val Lys Ala Pro Gly Phe Gly Asp Arg 
270 *. 275 280 

CGT AAA GCA ATG CTG CAG GAT ATC GCT ACC CTG ACC GGT GGT ACC GTT 914 
Arg Lys Ala Met Leu Gin Asp He Ala Thr Leu Thr Gly Gly Thr Val 
285 290 295 300 

ATC TCT GAA GAG ATC GGT ATG GAG CTG GAA AAA GCA ACT CTG GAA GAT 962 
He Ser Glu Glu lie Gly Met Glu Leu Glu Lys Ala Thr Leu Glu Asp 
305 310 315 

CTG GGC CAG GCG AAA CGC GTT GTT ATC AAC AAA GAT ACC ACC ACC ATC 1010 
Leu Gly Gin Ala Lys Arg Val Val He Asn Lys Asp Thr Thr Thr He 
320 325 330 

ATC GAT GGC GTG GGC GAC GAA GCT GCA ATC CAG GGT CGC GTG ACT CAG 1058 
He Asp Gly Val Gly Asp Glu Ala Ala He Gin Gly Arg Val Thr Gin 
335 340 345 

ATT CGT CAG CAG ATC GAA GAA GCA ACT TCC GAC TAT GAC CGT GAA AAA 1106 
He Arg Gin Gin He Glu Glu Ala Thr Ser Asp Tyr Asp Arg Glu Lys 
350 355 360 

CTG CAG GAG CGC GTA GCG AAA CTG GCA GGC GGC GTT GCG GTT ATC AAA 1154 
Leu Gin Glu Arg Val Ala Lys Leu Ala Gly Gly Val Ala Val He Lys 
365 370 375 380 

GTT GGT GCT GCG ACT GAA GTT GAA ATG AAA GAG AAG AAA GCC CGC GTT 1202 
Val Gly Ala Ala Thr Glu Val Glu Met Lys Glu Lys Lys Ala Arg Val 
385 390 395 

GAA GAT GCC CTG CAC GCT ACC CGT GCT GCG GTA GAA GAA GGC GTG GTT 125 0 

Glu Asp Ala Leu His Ala Thr Arg Ala Ala Val Glu Glu Gly Val Val 
400 405 410 

GCT GGT GGT GGC GTT GCG CTG ATT CGC GTA GCG TCT AAA ATT GCC GGC 12 98 

Ala Gly Gly Gly Val Ala Leu He Arg Val Ala Ser Lys He Ala Gly 
415 420 425 
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CTG AAA GGT CAG AAC GAA GAC CAG AAC GTA GGT ATC AAA GTT GCG CTG 134 6 

Leu Lys Gly Gin Asn Glu Asp Gin Asn Val Gly lie Lys Val Ala Leu 
430 435 440 

CGC GCA ATG GAA TCC CCA CTG CGT CAA ATC GTA CTG AAC TGC GGC GAA 13 94 

Arg Ala Met Glu Ser Pro Leu Arg Gin lie Val Leu Asn Cys Gly Glu 
445 450 455 460 

GAG CCG TCT GTA GTG GCT AAC ACC GTG AAA GCC GGT GAC GGT AAC TAC 1442 
Glu Pro Ser Val Val Ala Asn Thr Val Lys Ala Gly Asp Gly Asn Tyr 
465 470 475 

GGT TAC AAC GCT GCA ACT GAA GAA TAC GGC AAC ATG ATC GAC ATG GGT 14 9 0 

Gly Tyr Asn Ala Ala Thr Glu Glu Tyr Gly Asn Met lie Asp Met Gly 
480 485 490 

ATC CTG GAT CCA ACC AAA GTA ACT CGT TCT GCT CTG CAG TAC GCG GCT 1538 
lie Leu Asp Pro Thr Lys Val Thr Arg Ser Ala Leu Gin Tyr Ala Ala 
495 , 500 505 

TCT GTT GCG GGT CTG ATG ATC ACC ACC GAG TGC ATG GTT ACC GAC CTG 1586 
Ser Val Ala Gly Leu Met lie Thr Thr Glu Cys Met Val Thr Asp Leu 
510 515 520 

CCG AAA GGC GAT GCA CCT GAC TTA GGT GCT GCT GGT GGT ATG GGC GGC 1634 
Pro Lys Gly Asp Ala Pro Asp Leu Gly Ala Ala Gly Gly Met Gly Gly 
525 530 535 540 

ATG GGC GGA ATG ATG TGATCAAGCC GAATTC 166 5 

Met Gly Gly Met Met 
545 



(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 5 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:2: 

Met Ala Ala Lys Asp Val Lys Phe Gly Asn Asp Ala Arg Val Lys Met 
15 10 15 

Leu Arg Gly Val Asn Val Leu Ala Asp Ala Val Lys Val Thr Leu Gly 
20 25 30 

Pro Lys Gly Arg Asn Val Val Leu Asp Lys Ser Phe Gly Ala Pro Thr 
35 40 45 

lie Thr Lys Asp Gly Val Ser Val Ala Arg Glu He Glu Leu Glu Asp 
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50 55 60 

Lys Phe Glu Asn Met Gly Ala Gin Met Val Lys Glu Val Ala Ser Lys 
65 70 75 80 

Ala Asn Asp Ala Ala Gly Asp Gly Thr Thr Thr Ala Thr Val Leu Ala 
85 90 95 

Gin Ser lie lie Thr Glu Gly Leu Lys Ala Val Ala Ala Gly Met Asn 
100 105 110 

Pro Met Asp Leu Lys Arg Gly lie Asp Lys Ala Val Ala Ala Ala Val 
115 120 125 

Glu Glu Leu Lys Ala Leu Ser Val Pro Cys Ser Asp Ser Lys Ala lie 
130 135 140 

Ala Gin Val Gly Thr lie Ser Ala Asn Ser Asp Glu Thr Val Gly Lys 
145 150 155 160 

Leu lie Ala Glu Ala Met Asp Lys Val Gly Lys Glu Gly Val lie Thr 
165 170 175 

Val Glu Asp Gly Thr Gly Leu Gin Asp Glu Leu Asp Val Val Glu Gly 
180 185 190 

Met Gin Phe Asp Arg Gly Tyr Leu Ser Pro Tyr Phe lie Asn Lys Pro 
195 200 205 

Glu Thr Gly Ala Val Glu Leu Glu Ser Pro Phe lie Leu Leu Ala Asp 
210 215 220 

Lys Lys lie Ser Asn He Arg Glu Met Leu Pro Val Leu Glu Ala Val 
225 230 235 240 

Ala Lys Ala Gly Lys Pro Leu Leu He He Ala Glu Asp Val Glu Gly 
245 250 255 

Glu Ala Leu Ala Thr Leu Val Val Asn Thr Met Arg Gly He Val Lys 
260 265 270 

Val Ala Ala Val Lys Ala Pro Gly Phe Gly Asp Arg Arg Lys Ala Met 
275 280 285 

Leu Gin Asp He Ala Thr Leu Thr Gly Gly Thr Val He Ser Glu Glu 
290 295 300 

He Gly Met Glu Leu Glu Lys Ala Thr Leu Glu Asp Leu Gly Gin Ala 
305 310 315 320 

Lys Arg Val Val He Asn Lys Asp Thr Thr Thr He He Asp Gly Val 
325 330 335 



Gly Asp Glu Ala Ala He Gin Gly Arg Val Thr Gin lie Arg Gin Gin 
340 345 350 
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lie Glu Glu Ala Thr Ser Asp Tyr Asp Arg Glu Lys Leu Gin Glu Arg 
355 360 365 

Val Ala Lys Leu Ala Gly Gly Val Ala Val lie Lys Val Gly Ala Ala 
370 375 380 

Thr Glu Val Glu Met Lys Glu Lys Lys Ala Arg Val Glu Asp Ala Leu 
385 390 395 400 

His Ala Thr Arg Ala Ala Val Glu Glu Gly Val Val Ala Gly Gly Gly 
405 410 415 

Val Ala Leu lie Arg Val Ala Ser Lys lie Ala Gly Leu Lys Gly Gin 
420 425 430 

Asn Glu Asp Gin Asn Val Gly lie Lys Val Ala Leu Arg Ala Met Glu 
435 440 445 

Ser Pro Leu Arg Gin He Val Leu Asn Cys Gly Glu Glu Pro Ser Val 
450 455 460 

Val Ala Asn Thr Val Lys Ala Gly Asp Gly Asn Tyr Gly Tyr Asn Ala 
465 470 475 480 

Ala Thr Glu Glu Tyr Gly Asn Met He Asp Met Gly He Leu Asp Pro 
485 490 495 

Thr Lys Val Thr Arg Ser Ala Leu Gin Tyr Ala Ala Ser Val Ala Gly 
500 505 510 

Leu Met He Thr Thr Glu Cys Met Val Thr Asp Leu Pro Lys Gly Asp 
515 520 525 

Ala Pro Asp Leu Gly Ala Ala Gly Gly Met Gly Gly Met Gly Gly Met 
530 535 540 

Met 
545 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1654 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 15.. 1637 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 

GAATTCGGCT TCAT ATG GCA AAA GAA ATT AAA TTT TCA TCA GAT GCC CGT 50 
Met Ala Lys Glu lie Lys Phe Ser Ser Asp Ala Arg 
15 10 

TCA GCT ATG GTC CGT GGT GTC GAT ATC CTT GCA GAT ACT GTT AAA GTA 98 
Ser Ala Met Val Arg Gly Val Asp lie Leu Ala Asp Thr Val Lys Val 
15 20 25 

ACT TTG GGA CCA AAA GGT CGC AAT GTC GTT CTT GAA AAG TCA TTC GGT 14 6 

Thr Leu Gly Pro Lys Gly Arg Asn Val Val Leu Glu Lys Ser Phe Gly 
30 35 40 

TCA CCC TTG ATT ACC AAT GAC GGT GTG ACT ATT GCC AAA GAA ATT GAA 194 
Ser Pro Leu lie Thr Asn Asp Gly Val Thr lie Ala Lys Glu lie Glu 
45 50 55 60 

TTA GAA GAC CAT TTT GAA AAT ATG GGT GCC AAA TTG GTA TCA GAA GTA 242 
Leu Glu Asp His Phe Glu Asn Met Gly Ala Lys Leu Val Ser Glu Val . 

65 70 75 

GCT TCA AAA ACC AAT GAT ATC GCA GGT GAT GGA ACT ACA ACT GCA ACT 290 
Ala Ser Lys Thr Asn Asp lie Ala Gly Asp Gly Thr Thr Thr Ala Thr 
80 85 90 

GTT TTG ACC CAA GCA ATC GTC CGT GAA GGA ATC AAA AAC GTC ACA GCA 338 
Val Leu Thr Gin Ala lie Val Arg Glu Gly lie Lys Asn Val Thr Ala 
95 100 105 

GGT GCA AAT CCA ATC GGT ATT CGT CGT GGG ATT GAA ACA GCA GTT GCC 386 
Gly Ala Asn Pro lie Gly lie Arg Arg Gly lie Glu Thr Ala Val Ala 
110 115 120 

GCA GCA GTT GAA GCT TTG AAA AAC AAC GTC ATC CCT GTT GCC AAT AAA 434 
Ala Ala Val Glu Ala Leu Lys Asn Asn Val lie Pro Val Ala Asn Lys 
125 130 135 140 

GAA GCT ATC GCT CAA GTT GCA GCC GTA TCT TCT CGT TCT GAA AAA GTT 482 
Glu Ala lie Ala Gin Val Ala Ala Val Ser Ser Arg Ser Glu Lys Val 
145 150 155 

GGT GAG TAC ATC TCT GAA GCA ATG GAA AAA GTT GGC AAA GAC GGT GTC 53 0 

Gly Glu Tyr lie Ser Glu Ala Met Glu Lys Val Gly Lys Asp Gly Val 
160 165 170 

ATC ACC ATC GAA GAG TCA CGT GGT ATG GAA ACA GAG CTT GAA GTC GTA 578 
lie Thr He Glu Glu Ser Arg Gly Met Glu Thr Glu Leu Glu Val Val 
175 180 185 

GAA GGA ATG CAG TTT GAC CGT GGT TAC CTT TCA CAG TAC ATG GTG ACA 626 
Glu Gly Met Gin Phe Asp Arg Gly Tyr Leu Ser Gin Tyr Met Val Thr 
190 195 200 

GAT AGC GAA AAA ATG GTG GCT GAC CTT GAA AAT CCG TAC ATT TTG ATT 674 
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Asp Ser Glu Lys Met Val Ala Asp Leu Glu Asn Pro Tyr lie Leu lie 
205 210 215 220 

ACA GAC AAG AAA ATT TCC AAT ATC CAA GAA ATC TTG CCA CTT TTG GAA 722 
Thr Asp Lys Lys He Ser Asn He Gin Glu He Leu Pro Leu Leu Glu 
225 230 235 

AGC ATT CTC CAA AGC AAT CGT CCA CTC TTG ATT ATT GCG GAT GAT GTG 770 
Ser He Leu Gin Ser Asn Arg Pro Leu Leu He He Ala Asp Asp Val 
240 245 250 

GAT GGT GAG GCT CTT CCA ACT CTT GTT TTG AAC AAG ATT CGT GGA ACC 818 
Asp Gly Glu Ala Leu Pro Thr Leu Val Leu Asn Lys He Arg Gly Thr 
255 260 265 

TTC AAC GTA GTA GCA GTC AAG GCA CCT GGT TTT GGT GAC CGT CGC AAA 86 6 

Phe Asn Val Val Ala Val Lys Ala Pro Gly Phe Gly Asp Arg Arg Lys 
270 275 280 

GCC ATG CTT GAA GAT ATC GCC ATC TTA ACA GGC GGA ACA GTT ATC ACA 914 
Ala Met Leu Glu Asp He Ala He Leu Thr Gly Gly Thr Val He Thr 
285 290 295 300 

GAA GAC CTT GGT CTT GAG TTG AAA GAT GCG ACA ATT GAA GCT CTT GGT 962 
Glu Asp Leu Gly Leu Glu Leu Lys Asp Ala Thr He Glu Ala Leu Gly 
305 310 315 

CAA GCA GCG AGA GTG ACC GTG GAC AAA GAT AGC ACG GTT ATT GTA GAA 1010 
Gin Ala Ala Arg Val Thr Val Asp Lys Asp Ser Thr Val lie Val Glu 
320 325 330 

GGT GCA GGA AAT CCT GAA GCG ATT TCT CAC CGT GTT GCG GTT ATC AAG 1058 
Gly Ala Gly Asn Pro Glu Ala He Ser His Arg Val Ala Val lie Lys 
335 340 345 

TCT CAA ATC GAA ACT ACA ACT TCT GAA TTT GAC CGT GAA AAA TTG CAA 1106 
Ser Gin He Glu Thr Thr Thr Ser Glu Phe Asp Arg Glu Lys Leu Gin 
350 355 360 

GAA CGC TTG GCC AAA TTG TCA GGT GGT GTA GCG GTT ATT AAG GTC GGA 1154 
Glu Arg Leu Ala Lys Leu Ser Gly Gly Val Ala Val He Lys Val Gly 
365 370 375 380 

GCC GCA ACT GAA ACT GAG TTG AAA GAA ATG AAA CTC CGC ATT GAA GAT 1202 
Ala Ala Thr Glu Thr Glu Leu Lys Glu Met Lys Leu Arg He Glu Asp 
385 390 395 

GCC CTC AAC GCT ACT CGT GCA GCT GTT GAA GAA GGT ATT GTT GCA GGT 1250 
Ala Leu Asn Ala Thr Arg Ala Ala Val Glu Glu Gly He Val Ala Gly 
400 405 410 

GGT GGA ACA GCT CTT GCC AAT GTG ATT CCA GCT GTT GCT ACC TTG GAA 12 98 

Gly Gly Thr Ala Leu Ala Asn Val He Pro Ala Val Ala Thr Leu Glu 
415 420 425 
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TTG ACA GGA GAT GAA GCA ACA GGA CGT AAT ATT GTT CTC CGT GCT TTG 134 6 

Leu Thr Gly Asp Glu Ala Thr Gly Arg Asn lie Val Leu Arg Ala Leu 
430 435 440 

GAA GAA CCT GTT CGT CAA ATT GCT CAC AAT GCA GGA TTT GAA GGA TCT 1394 
Glu Glu Pro Val Arg Gin lie Ala His Asn Ala Gly Phe Glu Gly Ser 
445 450 455 460 

ATC GTT ATC GAT CGT TTG AAA AAT GCT GAG CTT GGT ATA GGA TTC AAC 1442 
lie Val lie Asp Arg Leu Lys Asn Ala Glu Leu Gly lie Gly Phe Asn 
465 470 475 

GCA GCA ACT GGC GAG TGG GTT AAC ATG ATT GAT CAA GGT ATC ATT GAT 14 9 0 

Ala Ala Thr Gly Glu Trp Val Asn Met lie Asp Gin Gly lie lie Asp 
480 485 490 

CCA GTT AAA GTG AGT CGT TCA GCC CTA CAA AAT GCA GCA TCT GTA GCC 1538 
Pro Val Lys Val Ser Arg Ser Ala Leu Gin Asn Ala Ala Ser Val Ala 
495 500 505 

AGC TTG ATT TTG ACA ACA GAA GCA GTC GTA GCC AAT AAA CCA GAA CCA 1586 
Ser Leu lie Leu Thr Thr Glu Ala Val Val Ala Asn Lys Pro Glu Pro 
510 515 520 

GTA GCC CCA GCT CCA GCA ATG GAT CCA AGT ATG ATG GGT GGA ATG GGC 1634 
Val Ala Pro Ala Pro Ala Met Asp Pro Ser Met Met Gly Gly Met Gly 
525 530 535 540 

GGA TGATCAAAGC CGAATTC 1654 
Gly 



(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 541 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 

Met Ala Lys Glu lie Lys Phe Ser Ser Asp Ala Arg Ser Ala Met Val 
15 10 15 

Arg Gly Val Asp lie Leu Ala Asp Thr Val Lys Val Thr Leu Gly Pro 
20 25 30 • 

Lys Gly Arg Asn Val Val Leu Glu Lys Ser Phe Gly Ser Pro Leu lie 
35 40 45 



Thr Asn Asp Gly Val Thr lie Ala Lys Glu He Glu Leu Glu Asp His 
50 55 60 



WO 99/35270 PCT/CA98/01203 

10 



Phe Glu Asn Met Gly Ala Lys Leu Val Ser Glu Val Ala Ser Lys Thr 
65 70 75 80 

Asn Asp lie Ala Gly Asp Gly Thr Thr Thr Ala Thr Val Leu Thr Gin 
85 90 95 

Ala lie Val Arg Glu Gly lie Lys Asn Val Thr Ala Gly Ala Asn Pro 
100 105 110 

He Gly He Arg Arg Gly He Glu Thr Ala Val Ala Ala Ala Val Glu 
115 120 125 

Ala Leu Lys Asn Asn Val He Pro Val Ala Asn Lys Glu Ala He Ala 
130 135 140 

Gin Val Ala Ala Val Ser Ser Arg Ser Glu Lys Val Gly Glu Tyr He 
145 150 155 160 

Ser Glu Ala Met Glu Lys Val Gly Lys Asp Gly Val He Thr He Glu 
165 170 175 

Glu Ser Arg Gly Met Glu Thr Glu Leu Glu Val Val Glu Gly Met Gin 
180 185 190 

Phe Asp Arg Gly Tyr Leu Ser Gin Tyr Met Val Thr Asp Ser Glu Lys 
195 200 205 

Met Val Ala Asp Leu Glu Asn Pro Tyr He Leu He Thr Asp Lys Lys 
210 215 220 

He Ser Asn He Gin Glu He Leu Pro Leu Leu Glu Ser lie Leu Gin 
225 230 235 240 

Ser Asn Arg Pro Leu Leu He He Ala Asp Asp Val Asp Gly Glu Ala 
245 250 255 

Leu Pro Thr Leu Val Leu Asn Lys He Arg Gly Thr Phe Asn Val Val 
260 265 270 

Ala Val Lys Ala Pro Gly Phe Gly Asp Arg Arg Lys Ala Met Leu Glu 
275 280 285 

Asp He Ala He Leu Thr Gly Gly Thr Val He Thr Glu Asp Leu Gly 
290 295 300 

Leu Glu Leu Lys Asp Ala Thr He Glu Ala Leu Gly Gin Ala Ala Arg 
305 310 315 320 

Val Thr Val Asp Lys Asp Ser Thr Val He Val Glu Gly Ala Gly Asn 
325 330 335 



Pro Glu Ala He Ser His Arg Val Ala Val He Lys Ser Gin He Glu 
340 345 350 
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Thr Thr Thr Ser Glu Phe Asp Arg Glu Lys Leu Gin Glu Arg Leu Ala 
355 360 365 

Lys Leu Ser Gly Gly Val Ala Val He Lys Val Gly Ala Ala Thr Glu 
370 375 380 

Thr Glu Leu Lys Glu Met Lys Leu Arg He Glu Asp Ala Leu Asn Ala 
385 390 395 400 

Thr Arg Ala Ala Val Glu Glu Gly He Val Ala Gly Gly Gly Thr Ala 
405 410 415 

Leu Ala Asn Val He Pro Ala Val Ala Thr Leu Glu Leu Thr Gly Asp 
420 425 430 

Glu Ala Thr Gly Arg Asn He Val Leu Arg Ala Leu Glu Glu Pro Val 
435 440 445 

Arg Gin He Ala His Asn Ala Gly Phe Glu Gly Ser He Val He Asp 
450 455 460 

Arg Leu Lys Asn Ala Glu Leu Gly He Gly Phe Asn Ala Ala Thr Gly 
465 470 475 480 

Glu Trp Val Asn Met He Asp Gin Gly lie lie Asp Pro Val Lys Val 
485 490 495 

Ser Arg Ser Ala Leu Gin Asn Ala Ala Ser Val Ala Ser Leu He Leu 
500 505 510 

Thr Thr Glu Ala Val Val Ala Asn Lys Pro Glu Pro Val Ala Pro Ala 
515 520 525 

Pro Ala Met Asp Pro Ser Met Met Gly Gly Met Gly Gly 
530 535 540 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1662 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ix) FEATURE : 

(A) NAME /KEY : CDS 

(B) LOCATION: 15. .1646 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 



GAATTCGGCT TCAT ATG GCG GCT AAA GAT GTA AAA TTC GGT AAC GAC GCT 
Met Ala Ala Lys Asp Val Lys Phe Gly Asn Asp Ala 



50 



WO 99/35270 



PCT/CA98/01203 



12 



15 10 

CGT GTA AAA ATG CTC CGC GGC GTA AAC GTA CTG GCA GAC GCA GTT AAA 98 
Arg Val Lys Met Leu Arg Gly Val Asn Val Leu Ala Asp Ala Val Lys 
15 20 25 

GTA ACC CTG GGC CCG AAA GGC CGT AAC GTA GTG CTG GAC AAA TCC TTC 14 6 

Val Thr Leu Gly Pro Lys Gly Arg Asn Val Val Leu Asp Lys Ser Phe 
30 35 40 

GGC GCG CCA ACC ATC ACG AAA GAT GGT GTT TCT GTA GCA CGT GAA ATC 194 
Gly Ala Pro Thr lie Thr Lys Asp Gly Val Ser Val Ala Arg Glu lie 
45 50 55 60 

GAG CTG GAA GAC AAG TTC GAA AAC ATG GGC GCG CAG ATG GTG AAA GAA 242 
Glu Leu Glu Asp Lys Phe Glu Asn Met Gly Ala Gin Met Val Lys Glu 
65 70 75 

GTG GCC TCT AAA GCG AAC GAC GCT GCA GGC GAC GGT ACC ACC ACC GCG 290 
Val Ala Ser Lys Ala Asn Asp Ala Ala Gly Asp Gly Thr Thr Thr Ala 
80 85 90 

ACC GTG CTG GCT CAG GCT ATC ATC ACC GAA GGT CTG AAA GCC GTT GCT 338 
Thr Val Leu Ala Gin Ala lie lie Thr Glu Gly Leu Lys Ala Val Ala 
95 100 105 

GCG GGC ATG AAC CCA ATG GAT CTG AAA CGT GGT ATC GAC AAA GCT GTC 386 
Ala Gly Met Asn Pro Met Asp Leu Lys Arg Gly lie Asp Lys Ala Val 
110 115 120 

GCG TCC GCT GTT GAA GAA CTG AAA GCG CTG TCC GTA CCG TGC TCT GAC 434 
Ala Ser Ala Val Glu Glu Leu Lys Ala Leu Ser Val Pro Cys Ser Asp 
125 130 135 . 140 

TCT AAA GCC ATT GCT CAG GTA GGT ACC ATC TCC GCT AAC TCC GAC GAA 4 82 

Ser Lys Ala lie Ala Gin Val Gly Thr lie Ser Ala Asn Ser Asp Glu 
145 150 155 

ACC GTA GGT AAA CTG ATC GCG GAA GCG ATG GAT AAA GTC GGT AAA GAA 53 0 

Thr Val Gly Lys Leu lie Ala Glu Ala Met Asp Lys Val Gly Lys Glu 
160 165 170 

GGC GTG ATC ACC GTT GAA GAC GGT ACC GGT CTG GAA GAC GAA CTG GAC 578 
Gly Val lie Thr Val Glu Asp Gly Thr Gly Leu Glu Asp Glu Leu Asp 
175 180 185 

GTG GTT GAA GGT ATG CAG TTC GAC CGC GGT TAC CTG TCC CCA TAC TTC 626 
Val Val Glu Gly Met Gin Phe Asp Arg Gly Tyr Leu Ser Pro Tyr Phe 
190 195 200 

ATC AAC AAG CCA GAA ACT GGC GCT GTT GAG CTG GAA AGC CCG TTC ATC 6 74 

lie Asn Lys Pro Glu Thr Gly Ala Val Glu Leu Glu Ser Pro Phe He 
205 210 215 220 

CTG CTG GCT GAC AAG AAA ATC TCC AAC ATC CGC GAA ATG CTG CCA GTG 722 
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Leu Leu Ala Asp Lys Lys lie Ser Asn lie Arg Glu Met Leu Pro Val 
225 230 235 

CTG GAA GCC GTT GCG AAA GCA GGC AAA CCG CTG GTT ATC ATT GCT GAA 770 
Leu Glu Ala Val Ala Lys Ala Gly Lys Pro Leu Val lie lie Ala Glu 
240 245 250 

GAC GTT GAA GGC GAA GCG CTG GCG ACC CTG GTG GTT AAC ACC ATG CGT 818 
Asp Val Glu Gly Glu Ala Leu Ala Thr Leu Val Val Asn Thr Met Arg 
255 260 265 

GGC ATC GTG AAA GTG GCT GCG GTT AAA GCA CCT GGC TTC GGC GAC CGC 866 
Gly /lie Val Lys Val Ala Ala Val Lys Ala Pro Gly Phe Gly Asp Arg 
270 275 280 

CGT AAA GCG ATG CTG CAG GAT ATC GCT ACC CTG ACC GGC GGT ACC GTC 914 
Arg Lys Ala Met Leu Gin Asp lie Ala Thr Leu Thr Gly Gly Thr Val 
285 290 295 300 

ATC TCT GAA GAG ATC GGT ATG GAG CTG GAA AAA GCG ACC CTG GAA' GAC 962 
lie Ser Glu Glu lie Gly Met Glu Leu Glu Lys Ala Thr Leu Glu Asp 
305 310 315 

CTG GGC CAG GCT AAA CGT GTT GTG ATC AAC AAA GAC ACC ACC ACC ATC 1010 
Leu Gly Gin Ala Lys Arg Val Val lie Asn Lys Asp Thr Thr Thr lie 
320 325 330 

ATC GAT GGC GTG GGC GAC GAA GCG GCG ATT CAG GGC CGT GTT GGT CAG 1058 
lie Asp Gly Val Gly Asp Glu Ala Ala lie Gin Gly Arg Val Gly Gin 
335 340 345 

ATC CGT AAG CAG ATC GAA GAA GCC ACT TCC GAT TAC GAC CGT GAA AAA 1106 
lie Arg Lys Gin lie Glu Glu Ala Thr Ser Asp Tyr Asp Arg Glu Lys 
350 355 360 

CTG CAG GAG CGC GTA GCG AAA CTG GCA GGC GGT GTT GCG GTA ATC AAA 1154 
Leu Gin Glu Arg Val Ala Lys Leu Ala Gly Gly Val Ala Val lie Lys 
365 370 375 380 

GTC GGT GCT GCG ACT GAA GTT GAA ATG AAA GAG AAA AAA GCA CGC GTT 1202 
Val Gly Ala Ala Thr Glu Val Glu Met Lys Glu Lys Lys Ala Arg Val 
385 390 395 

GAC GAT GCC CTG CAC GCG ACC CGT GCT GCG GTA GAA GAA GGC GTG GTT 1250 
Asp Asp Ala Leu His Ala Thr Arg Ala Ala Val Glu Glu Gly Val Val 
400 405 410 

GCT GGT GGT GGT GTG GCG CTG GTG CGT GTT GCC GCG AAA CTG TCC GGC 12 98 

Ala Gly Gly Gly Val Ala Leu Val Arg Val Ala Ala Lys Leu Ser Gly 
415 420 425 

CTG ACT GCT CAG AAC GAA GAT CAG AAC GTG GGT ATC AAA GTT GCG CTG 134 6 

Leu Thr Ala Gin Asn Glu Asp Gin Asn Val Gly lie Lys Val Ala Leu 
430 435 440 
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CGC GCA ATG GAA GCT CCA CTG CGT CAG ATC GTG TCC AAC GCC GGT GAA 13 94 

Arg Ala Met Glu Ala Pro Leu Arg Gin lie Val Ser Asn Ala Gly Glu 
445 450 455 460 

GAG CCA TCT GTT GTG ACC AAC AAC GTG AAA GCA GGC GAA GGT AAC TAC 1442 
Glu Pro Ser Val Val Thr Asri Asn Val Lys Ala Gly Glu Gly Asn Tyr 
465 470 475 

GGT TAC AAC GCA GCA ACT GAA GAA TAC GGC AAC ATG ATC GAC TTC GGT 14 90 

Gly Tyr Asn Ala Ala Thr Glu Glu Tyr Gly Asn Met lie Asp Phe Gly 
480 485 490 

ATC CTG GAT CCA ACC AAA GTG ACC CGT TCT GCT CTG CAG TAC GCG GCA 1538 
lie Leu Asp Pro Thr Lys Val Thr Arg Ser Ala Leu Gin Tyr Ala Ala 
495 500 505 

TCT GTC GCT GGC CTG ATG ATC ACC ACC GAG TGC ATG GTG ACC GAC CTG 1586 
Ser Val Ala Gly Leu Met lie Thr Thr Glu Cys Met Val Thr Asp Leu 
510 515 520 

CCT AAA GGC GAC GCA CCT GAC TTA GGT GCT GCA GGC ATG GGT GGG ATG 1634 
Pro Lys Gly Asp Ala Pro Asp Leu Gly Ala Ala Gly Met Gly Gly Met 
525 530 535 540 

GGC GGT ATG ATG TGATCAAGCC GAATTC 1662 
Gly Gly Met Met 



(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 544 amino acids 
<B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Met Ala Ala Lys Asp Val Lys Phe Gly Asn Asp Ala Arg Val Lys Met 
15 10 15 

Leu Arg Gly Val Asn Val Leu Ala Asp Ala Val Lys Val Thr Leu Gly 
20 25 30 

Pro Lys Gly Arg Asn Val Val Leu Asp Lys Ser Phe Gly Ala Pro Thr 
35 40 45 

lie Thr Lys Asp Gly Val Ser Val Ala Arg Glu lie Glu Leu Glu Asp 
50 55 60 

Lys Phe Glu Asn Met Gly Ala Gin Met Val Lys Glu Val Ala Ser Lys 
65 70 75 80 
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Ala Asn Asp Ala Ala Gly Asp Gly Thr Thr Thr Ala Thr Val Leu Ala 
. 85 90 95 

Gin Ala He He Thr Glu Gly Leu Lys Ala Val Ala Ala Gly Met Asn 
100 105 110 

Pro Met Asp Leu Lys Arg Gly He Asp Lys Ala Val Ala Ser Ala Val 
115 120 125 

Glu Glu Leu Lys Ala Leu Ser Val Pro Cys Ser Asp Ser Lys Ala lie 
130 135 140 

Ala Gin Val Gly Thr He Ser Ala Asn Ser Asp Glu Thr Val Gly Lys 
145 . • 150 155 160 

Leu He Ala Glu Ala Met Asp Lys Val Gly Lys Glu Gly Val He Thr 
165 170 175 

Val Glu Asp Gly Thr Gly Leu Glu Asp Glu Leu Asp Val Val Glu Gly 
180 185 190 

Met Gin Phe Asp Arg Gly Tyr Leu Ser Pro Tyr Phe He Asn Lys Pro 
195 200 205 

Glu Thr Gly Ala Val Glu Leu Glu Ser Pro Phe He Leu Leu Ala Asp 
210 215 220 

Lys Lys He Ser Asn He Arg Glu Met Leu Pro Val Leu Glu Ala Val 
225 230 235 240 

Ala Lys Ala Gly Lys Pro Leu Val He lie Ala Glu Asp Val Glu Gly 
245 250 255 

Glu Ala Leu Ala Thr Leu Val Val Asn Thr Met Arg Gly He Val Lys 
260 265 270 

Val Ala Ala Val Lys Ala Pro Gly Phe Gly Asp Arg Arg Lys Ala Met 
275 280 285 

Leu Gin Asp He Ala Thr Leu Thr Gly Gly Thr Val He Ser Glu Glu 
290 295 300 

He Gly Met Glu Leu Glu Lys Ala Thr Leu Glu Asp Leu Gly Gin Ala 
305 310 315 320 

Lys Arg Val Val He Asn Lys Asp Thr Thr Thr He He Asp Gly Val 
325 330 335 

Gly Asp Glu Ala Ala He Gin Gly Arg Val Gly Gin He Arg Lys Gin 
340 345 350 

He Glu Glu Ala Thr Ser Asp Tyr Asp Arg Glu Lys Leu Gin Glu Arg 
355 360 365 



Val Ala Lys Leu Ala Gly Gly Val Ala Val He Lys Val Gly Ala Ala 
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370 375 380 

Thr Glu Val Glu Met Lys Glu Lys Lys Ala Arg Val Asp Asp Ala Leu 
385 390 395 400 

His Ala Thr Arg Ala Ala Val Glu Glu Gly Val Val Ala Gly Gly Gly 
405 410 415 

Val Ala Leu Val Arg Val Ala Ala Lys Leu Ser Gly Leu Thr Ala Gin 
420 425 430 

Asn Glu Asp Gin Asn Val Gly lie Lys Val Ala Leu Arg Ala Met Glu 
435 440 445 

Ala Pro Leu Arg Gin lie Val Ser Asn Ala Gly Glu Glu Pro Ser Val 
450 455 460 

Val Thr Asn Asn Val Lys Ala Gly Glu Gly Asn Tyr Gly Tyr Asn Ala 
465 470 475 480 

Ala Thr Glu Glu Tyr Gly Asn Met lie Asp Phe Gly lie Leu Asp Pro 
485 490 495 

Thr Lys Val Thr Arg Ser Ala Leu Gin Tyr Ala Ala Ser Val Ala Gly 
500 505 510 

Leu Met lie Thr Thr Glu Cys Met Val Thr Asp Leu Pro Lys Gly Asp 
515 520 525 

Ala Pro Asp Leu Gly Ala Ala Gly Met Gly Gly Met Gly Gly Met Met 
530 535 540 

{2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1661 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 15.. 1649 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

GAATTCGGCT TCAT ATG GCA AAA GAA ATC AAA TTT TCA GCA GAT GCG CGT 50 
Met Ala Lys Glu lie Lys Phe Ser Ala Asp Ala Arg 
15 10 



GCT GCC ATG GTG CGC GGA GTT GAT ATG TTA GCA GAT ACC GTC AAA GTA 
Ala Ala Met Val Arg Gly Val Asp Met Leu Ala Asp Thr Val Lys Val 



98 
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ACG CTT GGT CCT AAA GGG CGC AAT GTT GTT CTT GAA AAA GCT TTT GGT 14 6 

Thr Leu Gly Pro Lys Gly Arg Asn Val Val Leu Glu Lys Ala Phe Gly 
30 35 40 

TCT CCC TTA ATT ACT AAT GAC GGG GTA ACC ATT GCT AAA GAG ATC GAA 194 
Ser Pro Leu lie Thr Asn Asp Gly Val Thr He Ala Lys Glu He Glu 
45 50 55 60 

TTA GAA GAT CAT TTT GAA AAC ATG GGA GCA AAA TTG GTG TCT GAA GTG 24 2 

Leu Glu Asp His Phe Glu Asn Met Gly Ala Lys Leu Val Ser Glu Val 
65 70 75 

GCT TCT AAA ACC AAT GAT ATT GCT GGT GAT GGG ACG ACT ACT GCA ACA 290 
Ala Ser Lys Thr Asn Asp He Ala Gly Asp Gly Thr Thr Thr Ala Thr 
80 85 90 

GTT TTG ACA CAA GCC ATT GTT CAT GAA GGA CTA AAA AAT GTG ACA GCA 33 8 

Val Leu Thr Gin Ala He Val His Glu Gly Leu Lys Asn Val Thr Ala 
95 100 105 

GGT GCT AAT CCA ATT GGT ATC CGT CGA GGC ATT GAA ACA GCA ACA GCA 386 
Gly Ala Asn Pro He Gly He Arg Arg Gly He Glu Thr Ala Thr Ala 
110 115 120 

ACA GCT GTT GAA GCC TTG AAA GCC ATT GCT CAA CCT GTA TCT GGC AAG 4 34 

Thr Ala Val Glu Ala Leu Lys Ala He Ala Gin Pro Val Ser Gly Lys 
125 130 135 140 

GAA GCT ATT GCT CAG GTC GCT GCA GTA TCA TCA CGC TCT GAA AAA GTT 4 82 

Glu Ala He Ala Gin Val Ala Ala Val Ser Ser Arg Ser Glu Lys Val 
145 150 155 

GGA GAG TAT ATC TCA GAA GCT ATG GAG CGT GTG GGC AAC GAT GGT GTG 53 0 

Gly Glu Tyr He Ser Glu Ala Met Glu Arg Val Gly Asn Asp Gly Val 
160 165 170 

ATT ACC ATC GAA GAA TCT CGA GGT ATG GAA ACA GAA CTT GAA GTG GTT 578 
He Thr He Glu Glu Ser Arg Gly Met Glu Thr Glu Leu Glu Val Val 
175 180 185 

GAA GGC ATG CAA TTT GAC CGT GGT TAG CTG TCT CAA TAC ATG GTC ACA 62 6 

Glu Gly Met Gin Phe Asp Arg Gly Tyr Leu Ser Gin Tyr Met Val Thr 
190 195 200 

GAC AAT GAA AAA ATG GTT GCA GAC CTT GAA AAC CCA TTT ATC TTA ATC 674 
Asp Asn Glu Lys Met Val Ala Asp Leu Glu Asn Pro Phe He Leu He 
205 210 215 220 

ACG GAT AAA AAA GTG TCA AAC ATC CAA GAC ATT TTG CCA CTA CTT GAG 722 
Thr Asp Lys Lys Val Ser Asn lie Gin Asp He Leu Pro Leu Leu Glu 
225 230 235 

GAA GTT CTT AAA ACC AAC CGT CCA TTA CTC ATT ATT GCA GAT GAT GTG 770 
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Glu Val Leu Lys Thr Asn Arg Pro Leu Leu lie lie Ala Asp Asp Val 
240 245 250 

GAT GGT GAA GCA CTT CCA ACC CTT GTC TTG AAC AAG ATT CGT GGT ACT 818 
Asp Gly Glu Ala Leu Pro Thr Leu Val Leu Asn Lys lie Arg Gly Thr 
255 260 265 

TTC AAT GTG GTT GCT GTC AAA GCG CCA GGA TTT GGT GAT CGT CGT AAA 866 
Phe Asn Val Val Ala Val Lys Ala Pro Gly Phe Gly Asp Arg Arg Lys 
270 275 280 

GCT ATG CTT GAA GAC ATT GCT ATC TTG ACA GGT GGT ACA GTG ATT ACA 914 
Ala Met Leu Glu Asp lie Ala lie Leu Thr Gly Gly Thr Val lie Thr 
285 290 295 300 

GAG GAT CTA GGA CTT GAA TTA AAA GAT GCT ACA ATG ACA GCC CTT GGA 962 
Glu Asp Leu Gly Leu Glu Leu Lys Asp Ala Thr Met Thr Ala Leu Gly 
305 310 315 

CAG GCT GCT AAG ATT ACA GTT GAT AAA GAT AGC ACA GTA ATT GTT GAA 1010 
Gin Ala Ala Lys lie Thr Val Asp Lys Asp Ser Thr Val lie Val Glu 
320 325 330 

GGT TCA GGA AGT TCA GAA GCT ATT GCT AAC CGT ATT GCA CTG ATT AAA 1058 
Gly Ser Gly Ser Ser Glu Ala lie Ala Asn Arg lie Ala Leu lie Lys 
335 340 345 

TCG CAA TTA GAA ACA ACA ACT TCT GAC TTT GAC CGT GAA AAA CTA CAA 1106 
Ser Gin Leu Glu Thr Thr Thr Ser Asp Phe Asp Arg Glu Lys Leu Gin 
350 . 355 360 

GAA CGT TTG GCG AAA TTA GCT GGT GGT GTA GCT GTT ATC AAA GTA GGA 1154 
Glu Arg Leu Ala Lys Leu Ala Gly Gly Val Ala Val lie Lys Val Gly 
365 370 375 380 

GCT CCA ACA GAG ACA GCT TTA AAA GAA ATG AAA CTT CGC ATT GAG GAT 12 02 

Ala Pro Thr Glu Thr Ala Leu Lys Glu Met Lys Leu Arg lie Glu Asp 
385 390 395 

GCT CTA AAT GCT ACA CGT GCA GCC GTT GAA GAA GGT ATC GTT GCT GGT 1250 
Ala Leu Asn Ala Thr Arg Ala Ala Val Glu Glu Gly lie Val Ala Gly 
400 405* 410 

GGT GGA ACA GCA CTT ATT ACG GTT ATT GAA AAA GTA GCA GCT CTT GAG 12 98 

Gly Gly Thr Ala Leu lie Thr Val lie Glu Lys Val Ala Ala Leu Glu 
415 420 425 

CTT GAG GGC GAT GAT GCT ACT GGA CGT AAC ATT GTG CTT CGT GCT CTA 1346 
Leu Glu Gly Asp Asp Ala Thr Gly Arg Asn lie Val Leu Arg Ala Leu 
430 435 440 

GAA GAG CCT GTA CGT CAA ATT GCT TTA AAT GCT GGG TAC GAA GGC TCC 1394 
Glu Glu Pro Val Arg Gin lie Ala Leu Asn Ala Gly Tyr Glu Gly Ser 
445 450 455 460 
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GTA GTT ATT GAC AAG TTG AAA AAC AGC CCT GCA GGA ACA GGA TTT AAT 144 2 

Val Val lie Asp Lys Leu Lys Asn Ser Pro Ala Gly Thr Gly Phe Asn 
465 470 475 

GCT GCA ACA GGT GAG TGG GTT GAT ATG ATT AAA ACA GGA ATC ATT GAC 14 9 0 

Ala Ala Thr Gly Glu Trp Val Asp Met lie Lys Thr Gly lie lie Asp 
480 485 490 

CCT GTC AAA GTA ACA CGA TCA GCG CTT CAA AAT GCA GCT TCT GTA GCT 153 8 

Pro Val Lys Val Thr Arg Ser Ala Leu Gin Asn Ala Ala Ser Val Ala 
495 500 505 

AGT CTT ATT TTG ACA ACA GAA GCA GTT GTT GCT AAT AAA CCT GAA CCA 1586 
Ser Leu lie Leu Thr Thr Glu Ala Val Val Ala Asn Lys Pro Glu Pro 
510 515 520 

GCT ACG CCA GCG CCA GCA ATG CCA GCA GGT ATG GAT CCA GGA ATG ATG 16 34 

Ala Thr Pro Ala Pro Ala Met Pro Ala Gly Met Asp Pro Gly Met Met 
525 530 535 540 

GGT GGG ATG GGC GGA TAAGCCGAAT TC 16 61 

Gly Gly Met Gly Gly 
545 



(2) INFORMATION FOR SEQ ID NO : 8 : 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 5 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:8: 

Met Ala Lys Glu lie Lys Phe Ser Ala Asp Ala Arg Ala Ala Met Val 
15 10 15 

Arg Gly Val Asp Met Leu Ala Asp Thr Val Lys Val Thr Leu Gly Pro 
20 25 30 

Lys Gly Arg Asn Val Val Leu Glu Lys Ala Phe Gly Ser Pro Leu lie 
35 40 45 

Thr Asn Asp Gly Val Thr lie Ala Lys Glu lie Glu Leu Glu Asp His 
50 55 60 

Phe Glu Asn Met Gly Ala Lys Leu Val Ser Glu Val Ala Ser Lys Thr 
65 70 75 80 

Asn Asp lie Ala Gly Asp Gly Thr Thr Thr Ala Thr Val Leu Thr Gin 
85 90 95 

Ala lie Val His Glu Gly Leu Lys Asn Val Thr Ala Gly Ala Asn Pro 
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100 105 110 

lie Gly lie Arg Arg Gly lie Glu Thr Ala Thr Ala Thr Ala Val Glu 
115 120 125 

Ala Leu Lys Ala lie Ala Gin Pro Val Ser Gly Lys Glu Ala lie Ala 
130 135 140 

Gin Val Ala Ala Val Ser Ser Arg Ser Glu Lys Val Gly Glu Tyr lie 
145 150 155 160 

Ser Glu Ala Met Glu Arg Val Gly Asn Asp Gly Val He Thr He Glu 
165 170 175 

Glu Ser Arg Gly Met Glu Thr Glu Leu Glu Val Val Glu Gly Met Gin 
180 185 190 

Phe Asp Arg Gly Tyr Leu Ser Gin Tyr Met Val Thr Asp Asn Glu Lys 
195 200 205 

Met Val Ala Asp Leu Glu Asn Pro Phe He Leu He Thr Asp Lys Lys 
210 215 220 

Val Ser Asn He Gin Asp He Leu Pro Leu Leu Glu Glu Val Leu Lys 
225 230 235 240 

Thr Asn Arg Pro Leu Leu He He Ala Asp Asp Val Asp Gly Glu Ala 
245 250 255 

Leu Pro Thr Leu Val Leu Asn Lys He Arg Gly Thr Phe Asn Val Val 
260 265 270 

Ala Val Lys Ala Pro Gly Phe Gly Asp Arg Arg Lys Ala Met Leu Glu 
275 280 285 

Asp He Ala He Leu Thr Gly Gly Thr Val He Thr Glu Asp Leu Gly 
290 295 300 

Leu Glu Leu Lys Asp Ala Thr Met Thr Ala Leu Gly Gin Ala Ala Lys 
305 310 315 320 

He Thr Val Asp Lys Asp Ser Thr Val He Val Glu Gly Ser Gly Ser 
325 330 335 

Ser Glu Ala He Ala Asn Arg He Ala Leu He Lys Ser Gin Leu Glu 
340 345 350 

Thr Thr Thr Ser Asp Phe Asp Arg Glu Lys Leu Gin Glu Arg Leu Ala 
355 360 365 

Lys Leu Ala Gly Gly Val Ala Val He Lys Val Gly Ala Pro Thr Glu 
370 375 380 

Thr Ala Leu$Lys Glu Met Lys Leu Arg He Glu Asp Ala Leu Asn Ala 
385 390 395 400 
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Thr Arg Ala Ala Val Glu Glu Gly lie Val Ala Gly Gly Gly Thr Ala 
405 410 415 

Leu lie Thr Val lie Glu Lys Val Ala Ala Leu Glu Leu Glu Gly Asp 
420 425 430 

Asp Ala Thr Gly Arg Asn lie Val Leu Arg Ala Leu Glu Glu Pro Val 
435 440 445 

Arg Gin lie Ala Leu Asn Ala Gly Tyr Glu Gly Ser Val Val lie Asp 
450 455 460 

Lys Leu Lys Asn Ser Pro Ala Gly Thr Gly Phe Asn Ala Ala Thr Gly 
465 470 475 480 

Glu Trp Val Asp Met lie Lys Thr Gly lie lie Asp Pro Val Lys Val 
485 490 495 

Thr Arg Ser Ala Leu Gin Asn Ala Ala Ser Val Ala Ser Leu He Leu 
500 505 510 

Thr Thr Glu Ala Val Val Ala Asn Lys Pro Glu Pro Ala Thr Pro Ala 
515 520 525 

Pro Ala Met Pro Ala Gly Met Asp Pro Gly Met Met Gly Gly Met Gly 
530 535 540 



Gly 
545 

(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 544 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

( D ) TOPOLOGY : 1 inear 



(xi) SEQUENCE DESCRIPTION: SEQ II 

Met Ala Lys Glu He Lys Phe Ser 
1 5 

Arg Gly Val Asp Ala Leu Ala Asp 
20 

Lys Gly Arg Asn Val Val Leu Glu 
35 40 

Thr Asn Asp Gly Val Thr He Ala 



l NO: 9: 

Glu Glu Ala Arg Arg Ala Met Leu 
10 15 

Ala Val Lys Val Thr Leu Gly Pro 
25 30 

Lys Lys Phe Gly Ser Pro Leu He 
45 

Lys Glu He Glu Leu Glu Asp Ala 
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50 55 60 

Phe Glu Asn Met Gly Ala Lys Leu Val Ala Glu Val Ala Ser Lys Thr 
65 70 75 80 

Asn Asp Val Ala Gly Asp Gly Thr Thr Thr Ala Thr Val Leu Ala Gin 
85 90 95 

Ala Met lie Arg Glu Gly Leu Lys Asn Val Thr Ala Gly Ala Asn Pro 
100 105 110 

Val Gly Val Arg Lys Gly Met Glu Gin Ala Val Ala Val Ala lie Glu 
115 120 125 

Asn Leu Lys Glu lie Ser Lys Pro lie Glu Gly Lys Glu Ser lie Ala 
130 135 140 

Gin Val Ala Ala lie Ser Ala Ala Asp Glu Glu Val Gly Ser Leu lie 
145 150 155 160 

Ala Glu Ala Met Glu Arg Val Gly Asn Asp Gly Val lie Thr lie Glu 
165 170 175 

Glu Ser Lys Gly Phe Thr Thr Glu Leu Glu Val Val Glu Gly Met Gin 
180 185 190 

Phe Asp Arg Gly Tyr Ala Ser Pro Tyr Met Val Thr Asp Ser Asp Lys 
195 200 205 

Met Glu Ala Val Leu Asp Asn Pro Tyr lie Leu lie Thr Asp Lys Lys 
210 215 220 

He Thr Asn He Gin Glu He Leu Pro Val Leu Glu Gin Val Val Gin 
225 230 235 240 

Gin Gly Lys Pro Leu Leu Leu He Ala Glu Asp Val Glu Gly Glu Ala 
245 250 255 

Leu Ala Thr Leu Val Val Asn Lys Leu Arg Gly Thr Phe Asn Ala Val 
260 265 270 

Ala Val Lys Ala Pro Gly Phe Gly Asp Arg Arg Lys Ala Met Leu Glu 
275 280 285 

Asp He Ala Val Leu Thr Gly Gly Glu Val He Thr Glu Asp Leu Gly 
290 295 300 

Leu Asp Leu Lys Ser Thr Gin He Ala Gin Leu Gly Arg Ala Ser Lys 
305 310 315 320 

Val Val Val Thr Lys Glu Asn Thr Thr He Val Glu Gly Ala Gly Glu 
325 330 335 

Thr Asp Lys He Ser Ala Arg Val Thr Gin He Arg Ala Gin Val Glu 
340 345 350 
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Glu Thr Thr Ser Glu Phe Asp Arg Glu Lys Leu Gin Glu Arg Leu Ala 
355 360 365 

Lys Leu Ala Gly Gly Val Ala Val lie Lys Val Gly Ala Ala Thr Glu 
370 375 380 

Thr Glu Leu Lys Glu Arg Lys Leu Arg lie Glu Asp Ala Leu Asn Ser 
385 390 395 400 

Thr Arg Ala Ala Val Glu Glu Gly lie Val Ser Gly Gly Gly Thr Ala 
405 410 415 

Leu Val Asn Val Tyr Asn Lys Val Ala Ala Val Glu Ala Glu Gly Asp 
420 425 430 

Ala Gin Thr Gly lie Asn lie Val Leu Arg Ala Leu Glu Glu Pro lie 
435 440 445 

Arg Gin lie Ala His Asn Ala Gly Leu Glu Gly Ser Val lie Val Glu 
450 455 460 

Arg Leu Lys Asn Glu Glu lie Gly Val Gly Phe Asn Ala Ala Thr Gly 
465 470 475 480 

Glu Trp Val Asn Met lie Glu Lys Gly lie Val Asp Pro Thr Lys Val 
485 490 495 

Thr Arg Ser Ala Leu Gin Asn Ala Ala Ser Val Ala Ala Met Phe Leu 
500 505 510 

Thr Thr Glu Ala Val Val Ala Asp Lys Pro Glu Glu Asn Gly Gly Gly 
515 520 525 

Ala Gly Met Pro Asp Met Gly Gly Met Gly Gly Met Gly Gly Met Met 
530 535 540 



(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 53 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 



Met 
1 



Ala Lys Thr Leu Leu Phe Gly Glu Glu Ala Arg Arg Ser Met Gin 
5 10 15 
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Ala Gly Val Asp Lys Leu Ala Asn Thr Val Lys Val Thr Leu Gly Pro 
20 25 30 

Lys Gly Arg Asn Val lie Leu Asp Lys Lys Phe Gly Ser Pro Leu lie 
35 40 45 

Thr Asn Asp Gly Val Thr lie Ala Arg Glu lie Glu Leu Glu Asp Ala 
50 55 60 

Tyr Glu Asn Met Gly Ala Gin Leu Val Lys Glu Val Ala Thr Lys Thr 
65 70 75 80 

Asn Asp Val Ala Gly Asp Gly Thr Thr Thr Ala Thr Leu Leu Ala Gin 
85 90 95 

Ala lie lie Arg Glu Gly Leu Lys Asn Val Thr Ala Gly Ala Asn Pro 
100 105 110 

He Leu He Arg Asn Gly He Lys Thr Ala Val Glu Lys Ala Val Glu 
115 120 . 125 

Glu He Gin Lys He Ser Lys Pro Val Asn Gly Lys Glu Asp He Ala 
130 135 140 

Arg Val Ala Ala He Ser Ala Ala Asp Glu Lys He Gly Lys Leu He 
145 • 150 155 160 

Ala Asp Ala Met Glu Lys Val Gly Asn Glu Gly Val He Thr Val Glu 
165 170 175 

Glu Ser Lys Ser Met Gly Thr Glu Leu Asp Val Val Glu Gly Met Gin 
180 185 190 

Phe Asp Arg Gly Tyr Val Ser Ala Tyr Met Val Thr Asp Thr Glu Lys 
195 200 205 

Met Glu Ala Val Leu Asp Asn Pro Leu Val Leu He Thr Asp Lys Lys 
210 215 220 

He Ser Asn lie Gin Asp Leu Leu Pro Leu Leu Glu Gin He Val Gin 
225 230 235 240 

Ala Gly Lys Lys Leu Leu He He Ala Asp Asp lie Glu Gly Glu Ala 
245 250 255 

Met Thr Thr Leu Val Val Asn Lys Leu Arg Gly Thr Phe Thr Cys Val 
260 265 270 

Gly Val Lys Ala Pro Gly Phe Gly Asp Arg Arg Lys Glu Met Leu Gin 
275 280 285 

Asp He Ala Thr Leu Thr Gly Gly Val Val He Ser Asp Glu Val Gly 
290 295 300 



Gly Asp Leu Lys Glu Ala Thr Leu Asp Met Leu Gly Glu Ala Glu Ser 
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305 310 315 320 

Val Lys Val Thr Lys Glu Ser Thr Thr lie Val Asn Gly Arg Gly Asn 
325 330 335 

Ser Glu Glu lie Lys Asn Arg lie Asn Gin lie Lys Leu Gin Leu Glu 
340 345 350 

Ala Thr Thr Ser Glu Phe Asp Lys Glu Lys Leu Gin Glu Arg Leu Ala 
355 360 365 

Lys Leu Ala Gly Gly Val Ala Val Val Lys Val Gly Ala Ala Thr Glu 
370 375 380 

Thr Glu Leu Lys Glu Ser Lys Leu Arg lie Glu Asp Ala Leu Ala Ala 
385 390 395 400 

Thr Lys Ala Ala Val Glu Glu Gly lie Val Pro Gly Gly Gly Thr Ala 
405 410 415 

Tyr Val Asn Val lie Asn Glu Val Ala Lys Leu Thr Ser Asp lie Gin 
420 425 430 

Asp Glu Gin Val Gly lie Asn He He Val Arg Ser Leu Glu Glu Pro 
435 440 445 

Met Arg Gin He Ala His Asn Ala Gly Leu Glu Gly Ser Val He He 
450 455 460 

Glu Lys Val Lys Asn Ser Asp Ala Gly Val Gly Phe Asp Ala Leu Arg 
465 470 475 480 

Gly Glu Tyr Lys Asp Met He Lys Ala Gly He Val Asp Pro Thr Lys 
485 490 495 

Val Thr Arg Ser Ala Leu Gin Asn Ala Ala Ser Val Ala Ser Thr Phe 
500 505 510 

Leu Thr Thr Glu Ala Ala Val Ala Asp He Pro Glu Lys Glu Met Pro 
515 520 525 

Gin Gly Ala Gly Met Gly Met Asp Gly Met Tyr 
530 535 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 551 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 11: 

Met Ala Asn Met Val Val Thr Gly Glu Gin Leu Asp Lys Ser lie Arg 
15 10 15 

Glu Val Val Arg lie Leu Glu Asp Ala Val Gly Cys Thr Ala Gly Pro 
20 25 30 

Lys Gly Leu Thr Val Ala lie Ser Lys Pro Tyr Gly Ala Pro Glu Val 
35 40 45 

Thr Lys Asp Gly Tyr Lys Val Met Lys Ser lie Lys Pro Glu Asp Pro 
50 55 60 

Leu Ala Leu Ala lie Ala Asn lie lie Ala Gin Ser Ala Ser Gin Cys 
65 70 75 80 

Asn Asp Lys Val Gly Asp Gly Thr Thr Thr Cys Ser lie Leu Thr Ala 
85 90 95 

Lys Val lie Glu Glu Val Ser Lys Val Lys Ala Ala Gly Ala Asp lie 
100 105 110 

lie Cys Val Arg Glu Gly Val Leu Lys Ala Lys Glu Ala Val Leu Glu 
115 120 125 

Ala Leu Lys Cys Met Lys Arg Glu Val Leu Ser Glu Glu Glu lie Ala 
130 135 140 

Gin .Val Ala Thr He Ser Ala Asn Gly Asp Lys Asn He Gly Thr Lys 
145 150 155 160 

He Ala Gin Cys Val Lys Glu Val Gly Lys Asp Gly Val He Thr Val 
165 170 175 

Glu Glu Ser Lys Gly Phe Lys Glu Leu Asp Val Glu Lys Thr Asp Gly 
180 185 190 

Met Gin Phe Asp Arg Gly Tyr Leu Ser Pro Tyr Phe Val Thr Asn Ser 
195 200 205 

Glu Lys Met Leu Val Glu Phe Glu Asn Pro Tyr He Leu Leu Thr Glu 
210 215 220 

Lys Lys Leu Asn He He Gin Pro Leu Leu Pro He Leu Glu Asn He 
225 230 235 240 

Ala Arg Ser Gly Arg Pro Leu Leu He He Ala Glu Asp Val Glu Gly 
245 250 255 

Glu Ala Leu Ser Thr Leu Val Leu Asn Lys Leu Arg Gly Gly Leu His 
260 265 270 



Val Ala Ala Val Lys Ala Pro Gly Phe Gly Asp Arg Arg Lys Asp Met 
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275 280 285 

Leu Gly Asp lie Ala lie Leu Thr Gly Ala Lys His Val lie Asn Asp 
290 295 300 

Glu Leu Ala lie Lys Met Glu Asp Leu Thr Leu Cys Asp Leu Gly Thr 
305 310 315 320 

Ala Lys Asn lie Arg lie Thr Lys Asp Thr Thr Thr lie lie Gly Ser 
325 330 335 

Val Asp Asn Ser Cys Ala His Val Gin Ser Arg lie Cys Gin lie Arg 
340 345 350 

Met Gin lie Asp Asn Ser Thr Ser Asp Tyr Asp Lys Glu Lys Leu Gin 
355 360 365 

Glu Arg Leu Ala Lys Leu Ser Gly Gly Val Ala Val Leu Lys Val Gly 
370 375 380 

Gly Ser Ser Glu Val Glu Val Lys Glu Arg Lys Asp Arg Val Glu Asp 
385 390 395 400 

Ala Leu His Ala Thr Arg Ala Ala Val Glu Glu Gly Val Val Pro Gly 
405 410 415 

Gly Gly Ala Ala Leu Leu Tyr Thr Leu Ser Ala Leu Asp Asn Leu Lys 
420 425 430 

Ser Lys Asn Asp Asp Glu Gin Leu Gly lie Asn lie Val Lys Arg Ala 
435 440 445 

Leu Gin Ala Pro lie Lys Arg lie lie Lys Asn Ala Gly Ser Glu Asn 
450 455 460 

Ala Pro Cys Val lie Ala His Leu Leu Lys Gin Asn Asp Lys Glu Leu 
465 470 475 480 

lie Phe Asn Val Asp Val Thr Asn Phe Ala Asn Ala Phe Thr Ser Gly 
485 490 495 

Val lie Asp Pro Leu Lys Val Val Arg lie Ala Phe Asp Phe Ala Val 
500 505 510 

Ser Leu Ala Ala Val Phe Met Thr Leu Asn Ala lie Val Val Asp lie 
515 520 525 

Pro Ser Lys Asp Asp Asn Ser Ala Ala Gly Gly Ala Gly Met Gly Gly 
530 535 540 

Met Gly Gly Met Gly Gly Phe 
545 550 

(2) INFORMATION FOR SEQ ID NO: 12: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Met Ala Ala Lys Asp Val Lys Phe Gly Asn Asp Ala Arg Val Lys Met 
15 10 15 

Leu Asn Gly Val Asn lie Leu Ala Asp Ala Val Lys Val Thr Leu Gly 
20 25 30 

Pro Lys Gly Arg Asn Val Val Leu Asp Lys Ser Phe Gly Ala Pro Thr 
35 40 45 

lie Thr Lys Asp Gly Val Ser Val Ala Arg Glu lie Glu Leu Glu Asp 
50 55 60 

Lys Phe Glu Asn Met Gly Ala Gin Met Val Lys Glu Val Ala Ser Lys 
65 70 75 80 

Ala Asn Asp Ala Ala Gly Asp Gly Thr Thr Thr Ala Thr Val Leu Ala 
85 90 95 

Gin Ala lie Val Asn Glu Gly Leu Lys Ala Val Ala Ala Gly Met Asn 
100 105 110 

Pro Met Asp Leu Lys Arg Gly lie Asp Lys Ala Val Asn Ser Val Val 
115 120 125 

Ala Glu Leu Lys Asn Leu Ser Lys Pro Cys Glu Thr Ser Lys Glu lie 
130 135 140 

Glu Gin Val Gly Thr lie Ser Ala Asn Ser Asp Ser lie Val Gly Gin 
145 150 155 160 

Leu lie Ala Gin Ala Met Glu Lys Val Gly Lys Glu Gly Val lie Thr 
165 170 175 

Val Glu Asp Gly Thr Gly Leu Glu Asp Glu Leu Asp Val Val Glu Gly 
180 185 190 

Met Gin Phe Asp Arg Gly Tyr Leu Ser Pro Tyr Phe lie Asn Lys Pro 
195 200 205 

Glu Thr Ala Gly Thr Val Glu Leu Asp Asn Pro Phe lie Leu Leu Val 
210 215 220 



Asp Lys Lys lie Ser Asn lie Arg Glu Leu Leu Pro Val Leu Glu Ala 



WO 99/35270 



29 



PCT/CA98/01203 



225 230 235 240 

Val Ala Lys Ala Gly Lys Pro Leu Leu lie lie Ala Glu Asp Val Glu 
245 250 255 

Gly Glu Ala Leu Ala Thr Leu Val Val Asn Thr Met Arg Gly lie Val 
260 265 270 

Lys Val Ala Ala Val Lys Ala Pro Gly Phe Gly Asp Arg Arg Lys Ala 
275 280 285 

Met Leu Gin Asp He Ala He Leu Thr Ala Gly Thr Val He Ser Glu 
290 295 300 

Glu He Gly Met Glu Leu Glu Lys Ala Thr Leu Glu Glu Leu Gly Gin 
305 310 315 320 

Ala Lys Arg Val Val He Thr Lys Asp Asn Thr Thr lie He Asp Gly 
325 330 335 

He Gly Asp Glu Ala Gin He Lys Ala Arg Val Val Gin He Arg Gin 
340 345 350 

Gin He Glu Asp Ser Thr Ser Asp Tyr Asp Lys Glu Lys Leu Gin Glu 
355 360 365 

Arg Val Ala Lys Leu Ala Gly Gly Val Ala Val He Lys Val Gly Ala 
370 375 380 

Ala Thr Glu Val Ala Met Lys Glu Lys Lys Asp Arg Val Asp Asp Ala 
385 390 395 400 

Leu His Ala Thr Arg Ala Ala- Val Glu Glu Gly He Val Pro Gly Gly 
405 410 415 

Gly Val Ala Leu Val Arg Ala Ala Asn Lys Val Ser Ala Thr Leu Thr 
420 425 430 

Gly Asp Asn Glu Glu Gin Asn Val Gly He Lys Leu Ala Leu Arg Ala 
435 440 445 

Met Glu Ala Pro Leu Arg Gin He Val Glu Asn Ser Gly Glu Asp Ala 
450 455 460 

Ser Val Val Ala Arg Asp Val Lys Asp Gly Ser Gly Asn Phe Gly Tyr 
465 470 475 480 

Asn Ala Thr Thr Glu Glu Tyr Gly Asp Met Leu Glu Met Gly He Leu 
485 490 495 

Asp Pro Thr Lys Val Thr Arg Ser Ala Leu Gin Phe Ala Ala Ser He 
500 505 510 

Ala Gly Leu Met He Thr Thr Glu Cys Met He Thr Asp Leu Pro Lys 
515 520 525 
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Glu Asp Lys Leu Asp Ala Gin Ala Ala Met Gly Gly Met Gly Gly Met 
530 535 540 

Gly Gly Met Met 



(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 549 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

( D ) TOPOLOGY : 1 inear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

Met Ala Lys Glu Leu Arg Phe Gly Asp Asp Ala Arg Leu Gin Met Leu 
15 10 15 

Ala Gly Val Asn Ala Leu Ala Asp Ala Val Gin Val Thr Met Gly Pro 
20 25 30 

Arg Gly Arg Asn Val Val Leu Glu Lys Ser Tyr Gly Ala Pro Thr Val 
35 40 45 

Thr Lys Asp Gly Val Ser Val Ala Lys Glu lie Glu Phe Glu His Arg 
50 55 60 

Phe Met Asn Met Gly Ala Gin Met Val Lys Glu Val Ala Ser Lys Thr 
65 70 75 80 

Ser Asp Thr Ala Gly Asp Gly Thr Thr Thr Ala Thr Val Leu Ala Arg 
85 90 95 

Ser lie Leu Val Glu Gly His Lys Ala Val Ala Ala Gly Met Asn Pro 
100 105 110 

Met Asp Leu Lys Arg Gly lie Asp Lys Ala Val Leu Ala Val Thr Lys 
115 120 125 

Lys Leu Gin Ala Met Ser Lys Pro Cys Lys Asp Ser Lys Ala lie Ala 
130 135 140 

Gin Val Gly Thr lie Ser Ala Asn Ser Asp Glu Ala lie Gly Ala lie 
145 150 155 160 

lie Ala Glu Ala Met Glu Lys Val Gly Lys Glu Gly Val lie Thr Val 
165 170 175 

Glu Asp Gly Asn Gly Leu Glu Asn Glu Leu Ser Val Val Glu Gly Met 
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180 



185 



190 



Gin Phe Asp Arg 
195 

Asn Met Ser Cys 
210 

Lys Val Ser Ser 
225 

Lys Ser Gly Arg 



Ala Leu Ala Thr 
260 



Gly Tyr lie Ser 
200 

Glu Leu Glu His 
215 

lie Arg Glu Met 
230 

Pro Leu Leu lie 
245 

Leu Val Val Asn 



Pro Tyr Phe lie 



Pro Phe lie Leu 
220 

Leu Ser Val Leu 
235 

lie Ala Glu Asp 
250 

Asn Met Arg Gly 
265 



Asn Asn Gin Gin 
205 

Leu Val Asp Lys 



Glu Gly Val Ala 
240 

Val Glu Gly Glu 
255 

lie Val Lys Val 
270 



Cys Ala Val Lys Ala Pro Gly Phe Gly Asp Arg Arg Lys Ala Met Leu 
275 280 285 

Gin Asp lie Ala lie Leu Thr Lys Gly Gin Val lie Ser Glu Glu lie 
290 295 300 

Gly Lys Ser Leu Glu Gly Ala Thr Leu Glu Asp Leu Gly Ser Ala Lys 
305 310 315 320 

Arg lie Val Val Thr Lys Glu Asn Thr Thr lie lie Asp Gly Glu Gly 
325 330 335 

Lys Ala Thr Glu He Asn Ala Arg He Thr Gin He Arg Ala Gin Met 
340 345 350 

Glu Glu Thr Thr Ser Asp Tyr Asp Arg Glu Lys Leu Gin Glu Arg Val 
355 360 365 

Ala Lys Leu Ala Gly Gly Val Ala Val He Lys Val Gly Ala Ala Thr 
370 375 380 

Glu Val Glu Met Lys Glu Lys Lys Ala Arg Val Glu Asp Ala Leu His 
385 390 395 400 

Ala Thr Arg Ala Ala Val Glu Glu Gly He Val Ala Gly Gly Gly Val 
405 410 415 

Ala Leu He Arg Ala Gin Lys Ala Leu Asp Ser Leu Lys Gly Asp Asn 
420 425 430 

Asp Asp Gin Asn Met Gly He Asn He Leu Arg Arg Ala He Glu Ser 
435 440 445 

Pro Met Arg Gin He Val Thr Asn Ala Gly Tyr Glu Ala Ser Val Val 
450 455 460 



Val Asn Lys Val Ala Glu His Lys Asp Asn Tyr Gly Phe Asn Ala Ala 
465 470 475 480 
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Thr Gly Glu Tyr Gly Asp Met Val Glu Met Gly lie Leu Asp Pro Thr 
485 ' 490 495 

Lys Val Thr Arg Met Ala Leu Gin Asn Ala Ala Ser Val Ala Ser Leu 
500 505 510 

Met Leu Thr Thr Glu Cys Met Val Ala Asp Leu Pro Lys Lys Glu Glu 
515 520 525 

Gly Val Gly Ala Gly Asp Met Gly Gly Met Gly Gly Met Gly Gly Met 
530 535 540 

Gly Gly Met Met Glx 
545 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 541 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNESS : 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:14: 

Met Ala Lys Thr lie Ala Tyr Asp Glu Glu Ala Arg Arg Gly Leu Glu 
15 10 15 

Arg Gly Leu Asn Ala Leu Ala Asp Ala Val Lys Val Thr Leu Gly Pro 
20 25 30 

Lys Gly Arg Asn Val Val Leu Glu Lys Lys Trp Gly Ala Pro Thr lie 
35 40 45 

Thr Asn Asp Gly Val Ser lie Ala Lys Glu lie Glu Leu Glu Asp Pro 
50 55 60 

Tyr Glu Lys lie Gly Ala Glu Leu Val Lys Glu Val Ala Lys Lys Thr 
65 70 75 80 

Asp Asp Val Ala Gly Asp Gly Thr Thr Thr Ala Thr Val Leu Ala Gin 
85 90 95 

Ala Leu Val Arg Glu Gly Leu Arg Asn Val Ala Ala Gly Ala Asn Pro 
100 105 110 

Leu Gly Leu Lys Arg Gly lie Glu Lys Ala Val Glu Lys Val Thr Glu 
115 120 125 

Thr Leu Leu Lys Ser Ala Lys Glu Val Glu Thr Lys Asp Gin lie Ala 
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130 135 140 

Ala Thr Ala Ala He Ser Ala Gly Asp Gin Ser He Gly Asp Leu He 
145 150 155 160 

Ala Glu Ala Met Asp Lys Val Gly Asn Glu Gly Val He Thr Val Glu 
165 170 175 

Glu Ser Asn Thr Phe Gly Leu Gin Leu Glu Leu Thr Glu Gly Met Arg 
180 185 190 

Phe Asp Lys Gly Tyr He Ser Gly Tyr Phe Val Thr Asp Ala Glu Arg 
195 200 205 

Gin Glu Ala Val Leu Glu Asp Pro Phe He Leu Leu Val Ser Ser Lys 
210 215 220 

Val Ser Thr Val Lys Asp Leu Leu Pro Leu Leu Glu Lys Val He Gin 
225 230 235 240 

Ala Gly Lys Pro Leu Leu He He Ala Glu Asp Val Glu Gly Glu Ala 
245 250 255 

Leu Ser Thr Leu Val Val Asn Lys He Arg Gly Thr Phe Lys Ser Val 
260 265 270 

Ala Val Lys Ala Pro Gly Phe Gly Asp Arg Arg Lys Ala Met Leu Gin 
275 280 285 

Asp Met Ala He Leu Thr Gly Gly Gin Val He Ser Glu Glu Val Gly 
290 295 300 

Leu Ser Leu Glu Ser Ala Asp He Ser Leu Leu Gly Lys Ala Arg Lys 
305 310 315 ' 320 

Val Val Val Thr Lys Asp Glu Thr Thr He Val Glu Gly Ala Gly Asp 
325 330 335 

Ser Asp Ala He Ala Gly Arg Val Ala Gin He Arg Thr Glu He Glu 
340 345 350 

Asn Ser Asp Ser Asp Tyr Asp Arg Glu Lys Leu Gin Glu Arg Leu Ala 
355 360 365 

Lys Leu Ala Gly Gly Val Ala Val He Lys Ala Gly Ala Ala Thr Glu 
^ 370 375 380 

Val Glu Leu Lys Glu Arg Lys His Arg He Glu Asp Ala Val Arg Asn 
385 390 395 400 

Ala Lys Ala Ala Val Glu Glu Gly He Val Ala Gly Gly Gly Val Ala 
405 410 415 



Leu Leu His Ala He Pro Ala Leu Asp Glu Leu Lys Pro Glu Gly Glu 
420 425 430 
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Glu Ala Thr Gly Ala Asn lie Val Arg Val Ala Leu Glu Arg Pro Leu 
435 440 445 

Lys Gin lie Ala Phe Asn Gly Gly Leu Glu Pro Gly Val Val Ala Glu 
450 455 460 

Lys Val Arg Asn Ser Pro Ala Gly Thr Gly Leu Asn Ala Ala Thr Gly 
465 470 475 480 

Glu Tyr Glu Asp Leu Leu Lys Ala Gly lie Ala Asp Pro Val Lys Val 
485 490 495 

Thr Arg Ser Ala Leu Gin Asn Ala Ala Ser lie Ala Gly Leu Phe Leu 
500 505 510 

Thr Thr Glu Ala Val Val Ala Asp Lys Pro Glu Lys Ala Ala Ala Pro 
515 520 525 

Ala Gly Asp Pro Thr Gly Gly Met Gly Gly Met Asp Phe 
530 535 540 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 54 0 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

Met Ala Lys Thr lie Ala Tyr Asp Glu Glu Ala Arg Arg Gly Leu Glu 

15 10 15 

Arg Gly Leu Asn Ala Leu Ala Asp Ala Val Lys Val Thr Leu Gly Pro 
20 25 30 

Lys Gly Arg Asn Val Val Leu Glu Lys Lys Trp Gly Ala Pro Thr lie 
35 40 45 

Thr Asn Asp Gly Val Ser lie Ala Lys Glu lie Glu Leu Glu Asp Pro 
50 55 60 

Tyr Glu Lys lie Gly Ala Glu Leu Val Lys Glu Val Ala Lys Lys Thr 
65 70 75 80 

Asp Asp Val Ala Gly Asp Gly Thr Thr Thr Ala Thr Val Leu Ala Gin 

85 90 95 



Ala Leu Val Arg Glu Gly Leu Arg Asn Val Ala Ala Gly Ala Asn Pro 
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100 105 110 

Leu Gly Leu Lys Arg Gly lie Glu Lys Ala Val Glu Lys Val Thr Glu 
115 120 125 

Thr Leu Leu Lys Gly Ala Lys Glu Val Glu Thr Lys Glu Gin lie Ala 
130 , 135 140 

Ala Thr Ala Ala lie Ser Ala Gly Asp Gin Ser lie Gly Asp Leu lie 
145 150 155 160 

Ala Glu Ala Met Asp Lys Val Gly Asn Glu Gly Val lie Thr Val Glu 
165 170 175 

Glu Ser Asn Thr Phe Gly Leu Gin Leu Glu Leu Thr Glu Gly Met Arg 
180 185 190 

Phe Asp Lys Gly Tyr lie Ser Gly Tyr Phe Val Thr Asp Pro Glu Arg 
195 200 205 

Gin Glu Ala Val Leu Glu Asp Pro Tyr lie Leu Leu Val Ser Ser Lys 
210 215 220 

Val Ser Thr Val Lys Asp Leu Leu Pro Leu Leu Glu Lys Val lie Gly 
225 230 235 240 

Ala Gly Lys Pro Leu Leu lie lie Ala Glu Asp Val Glu Gly Glu Ala 
245 250 255 

Leu Ser Thr Leu Val Val Asn Lys lie Arg Gly Thr Phe Lys Ser Val 
260 265 270 

Ala Val Lys Ala Pro Gly Phe Gly Asp Arg Arg Lys Ala Met Leu Gin 
275 280 285 

Asp Met Ala lie Leu Thr Gly Gly Gin Val lie Ser Glu Glu Val Gly 
290 295 300 

Leu Thr Leu Glu Asn Ala Asp Leu Ser Leu Leu Gly Lys Ala Arg Lys 
305 310 315 320 

Val Val Val Thr Lys Asp Glu Thr Thr lie Val Glu Gly Ala Gly Asp 
325 330 335 

Thr Asp Ala lie Ala Gly Arg Val Ala Gin lie Arg Gin Glu lie Glu 
340 345 350 

Asn Ser Asp Ser Asp Tyr Asp Arg Glu Lys Leu Gin Glu Arg Leu Ala 
355 360 365 

Lys Leu Ala Gly Gly Val Ala Val lie Lys Ala Gly Ala Ala Thr Glu 
370 375 380 

Val Glu Leu Lys Glu Arg Lys His Arg lie Glu Asp Ala Val Arg Asn 
385 390 395 400 
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Ala Lys Ala Ala 



Leu Leu Gin Ala 
420 

Glu Ala Thr Gly 
435 

Lys Gin lie Ala 
450 

Lys Val Arg Asn 
465 

Val Tyr Glu Asp 



Thr Arg Ser Ala 
500 

Thr Thr Glu Ala 
515 

Val Pro Gly Gly 
530 



Val Glu Glu Gly 
405 

Ala Pro Thr Leu 



Ala Asn lie Val 
440 

Phe Asn Ser Gly 
455 

Leu Pro Ala Gly 
470 

Leu Leu Ala Ala 
485 

Leu Gin Asn Ala 



Val Val Ala Asp 
520 

Gly Asp Met Gly 
535 



He Val Ala Gly 
410 

Asp Glu Leu Lys 
425 

Lys Val Ala Leu 



Leu Glu Pro Gly 
460 

His Gly Leu Asn 
475 

Gly Val Ala Asp 
490 

Ala Ser He Alai 
505 

Lys Pro Glu Lys 



Gly Met Asp Phe 
540 



Gly Gly Val Thr 
415 

Leu Glu Gly Asp 
430 

Glu Ala Pro Leu 
445 

Val Val Ala Glu 



Ala Gin Thr Gly 
480 

Pro Val Lys Val 
495 

Gly Leu Phe Leu 
510 

Glu Lys Ala Ser 
525 



{2) INFORMATION FOR SEQ ID NO: 16: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 537 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNESS : 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

Met Ser Lys Leu He Glu Tyr Asp Glu Thr Ala Arg His Ala Met Glu 
15 10 15 

Val Gly Met Asn Lys Leu Ala Asp Thr Val Arg Val Thr Leu Gly Pro 
20 25 30 

Arg Gly Arg His Val Val Leu Ala Lys Ala Phe Gly Gly Pro Thr He 
35 40 45 

Thr Asn Asp Gly Val Thr Val Ala Arg Glu He Asp Leu Glu Asp Pro 
50 55 60 

Phe Glu Asn Leu Gly Ala Gin Leu Val Lys Ser Val Ala Thr Lys Thr 
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65 70 75 80 

Asn Asp Val Ala Gly Asp Gly Thr Thr Thr Ala Thr Val Leu Ala Gin 
85 90 95 

Ala Leu Val Lys Gly Gly Leu Arg Met Val Ala Ala Gly Ala Asn Pro 
100 105 110 

Val Ala Leu Gly Ala Gly lie Ser Lys Ala Ala Asp Ala Val Ser Glu 
115 120 125 

Ala Leu Leu Ala Val Ala Thr Pro Val Ala Gly Lys Asp Ala lie Thr 
130 135 140 

Gin Val Ala Thr Val Ser Ser Arg Asp Glu Gin lie Gly Ala Leu Val 
145 150 155 160 

Gly Glu Gly Met Asn Lys Val Gly Thr Asp Gly Val Val Ser Val Glu 
165 170 175 

Glu Ser Ser Thr Leu Asp Thr Glu Leu Glu Phe Thr Glu Gly Val Gly 
180 185 190 

Phe Asp Lys Gly Phe Leu Ser Ala Tyr Phe Val Thr Asp Phe Asp Ser 
195 200 205 

Gin Gin Ala Val Leu Asp Asp Pro Leu Val Leu Leu His Gin Glu Lys 
210 215 220 

lie Ser Ser Leu Pro Glu Leu Leu Pro Met Leu Glu Lys Val Thr Glu 
225 230 235 240 

Ser Gly Lys Pro Leu Leu lie Val Ala Glu Asp Leu Glu Gly Glu Ala 
245 250 255 

Leu Ala Thr Leu Val Val Asn Ser He Arg Lys Thr Leu Lys Ala Val 
260 265 270 

Ala Val Lys Ser Pro Phe Phe Gly Asp Arg Arg Lys Ala Phe Leu Glu 
275 280 285 

Asp Leu Ala He Val Thr Gly Gly Gin Val Val Asn Pro Glu Thr Gly 
290 295 300 

Leu Val Leu Arg Glu Val Gly Thr Asp Val Leu Gly Ser Ala Arg Arg 
305 310 315 320 

Val Val Val Ser Lys Asp Asp Thr He He Val Asp Gly Gly Gly Ser 
325 330 335 

Asn Asp Ala Val Ala Lys Arg Val Asn Gin Leu Arg Ala Glu He Glu 
340 345 350 



Val Ser Asp Ser Glu Trp Asp Arg Glu Lys Leu Gin Glu Arg Val Ala 
355 360 365 
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Lys Leu Ala Gly Gly Val Ala Val lie Lys Val Gly Ala Val Thr Glu 
370 375 380 

Thr Ala Leu Lys Lys Arg Lys Glu Ser Val Glu Asp Ala Val Ala Ala 
385 390 395 400 

Ala Lys Ala Ser lie Glu Glu Gly lie lie Ala Gly Gly Gly Ser Ala 
405 410 415 

Leu Val Gin Cys Gly Ala Ala Leu Lys Gin Leu Arg Thr Ser Leu Thr 
420 425 430 

Gly Asp Glu Ala Leu Gly lie Asp Val Phe Phe Glu Ala Leu Lys Ala 
435 440 445 

Pro Leu Tyr Trp lie Ala Thr Asn Ala Gly Leu Asp Gly Ala Val Val 
450 455 460 

Val Asp Lys Val Ser Gly Leu Pro Ala Gly His Gly Leu Asn Ala Ser 
465 470 475 480 

Thr Leu Gly Tyr Gly Asp Leu Val Ala Asp Gly Val Val Asp Pro Val 
485 490 495 

Lys Val Thr Arg Ser Ala Val Leu Asn Ala Ala Ser Val Ala Arg Met 
500 505 510 

Met Leu Thr Thr Glu Thr Ala Val Val Asp Lys Pro Ala Lys Thr Glu 
515 520 525 

Glu His Asp His His Gly His Ala His 
530 . 535 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 541 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNESS : 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17 : 

Met Ala Lys Thr lie Ala Tyr Asp Glu Glu Ala Arg Arg Gly Leu Glu 
15 10 15 

Arg Gly Leu Asn Ser Leu Ala Asp Ala Val Lys Val Thr Leu Gly Pro 
20 25 30 

Lys Gly Arg Asn Val Val Leu Glu Lys Lys Trp Gly Ala Pro Thr lie 
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35 40 45 

Thr Asn Asp Gly Val Ser lie Ala Lys Glu lie Glu Leu Glu Asp Pro 
50 55 60 

Tyr Glu Lys lie Gly Ala Glu Leu Val Lys Glu Val Ala Lys Lys Thr 
65 70 75 80 

Asp Asp Val Ala Gly Asp Gly Thr Thr Thr Ala Thr Val Leu Ala Gin 
85 90 95 

Ala Leu Val Lys Glu Gly Leu Arg Asn Val Ala Ala Gly Ala Asn Pro 
100 105 110 

Leu Gly Leu Lys Arg Gly lie Glu Lys Ala Val Asp Lys Val Thr Glu 
115 120 125 

Thr Leu Leu Lys Asp Ala Lys Glu Val Glu Thr Lys Glu Gin lie Ala 
130 135 140 

Ala Thr Ala Ala lie Ser Ala Gly Asp Gin Ser lie Gly Asp Leu lie 
145 150 155 160 

Ala Glu Ala Met Asp Lys Val Gly Met Glu Gly Val lie Thr Val Glu 
165 170 175 

Glu Ser Asn Thr Phe Gly Leu Gin Leu Glu Leu Thr Glu Gly Met Arg 
180 185 190 

Phe Asp Lys Gly Tyr lie Ser Gly Tyr Phe Val Thr Asp Ala Glu Arg 
195 200 205 

Gin Glu Ala Val Leu Glu Glu Pro Tyr lie Leu Leu Val Ser Ser Lys 
210 215 220 

Val Ser Thr Val Lys Asp Leu Leu Pro Leu Leu Glu Lys Val lie Gin 
225 230 235 240 

Ala Gly Lys Ser Leu Leu lie lie Ala Glu Asp Val Glu Gly Glu Ala 
245 250 255 

Leu Ser Thr Leu Val Val Asn Lys lie Arg Gly Thr Phe Lys Ser Val 
260 265 270 

Ala Val Lys Ala Pro Gly Phe Gly Asp Arg Arg Lys Ala Met Leu Gin 
275 280 285 

Asp Met Ala lie Leu Thr Gly Ala Gin Val lie Ser Glu Glu Val Gly 
290 295 300 

Leu Thr Leu Glu Asn Thr Asp Leu Ser Leu Leu Gly Lys Ala Arg Lys 
305 310 315 320 



Val Val Met Thr Lys Asp Glu Thr Thr lie Val Glu Gly Ala Gly Asp 
325 330 335 
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Thr Asp Ala lie Ala Gly Arg Val Ala Gin He Arg Thr Glu He Glu 
340 345 350 

Asn Ser Asp Ser Asp Tyr Asp Arg Glu Lys Leu Gin Glu Arg Leu Ala 
355 360 365 

Lys Leu Ala Gly Gly Val Ala Val He Lys Ala Gly Ala Ala Thr Glu 
370 375 380 

Val Glu Leu Lys Glu Arg Lys His Arg He Glu Asp Ala Val Arg Asn 
385 390 395 400 

Ala Lys Ala Ala Val Glu Glu Gly He Val Ala Gly Gly Gly Val Thr 
405 410 415 

Leu Leu Gin Ala Ala Pro Ala Leu Asp Lys Leu Lys Leu Thr Gly Asp 
420 425 430 

Glu Ala Thr Gly Ala Asn He Val Lys Val Ala Leu Glu Ala Pro Leu 
435 440 445 

Lys Gin He Ala Phe Asn Ser Gly Met Glu Pro Gly Val Val Ala Glu 
450 455 460 

Lys Val Arg Asn Leu Ser Val Gly His Gly Leu Asn Ala Ala Thr Gly 
465 470 475 480 

Glu Tyr Glu Asp Leu Leu Lys Ala Gly Val Ala Asp Pro Val Lys Val 
485 490 495 

Thr Arg Ser Ala Leu Gin Asn Ala Ala Ser He Ala Gly Leu Phe Leu 
500 505 510 

Thr Thr Glu Ala Val Val Ala Asp Lys Pro Glu Lys Thr Ala Ala Pro 
515 520 525 

Ala Ser Asp Pro Thr Gly Gly Met Gly Gly Met Asp Phe 
530 535 540 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 539 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

Met Ser Lys Leu He Glu Tyr Asp Glu Thr Ala Arg Arg Ala Met Glu 
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1 5 10 15 

Val Gly Met Asp Lys Leu Ala Asp Thr Val Arg Val Thr Leu Gly Pro 
20 25 30 

Arg Gly Arg His Val Val Leu Ala Lys Ala Phe Gly Gly Pro Thr Val 
35 40 45 

Thr Asn Asp Gly Val Thr Val Ala Arg Glu lie Glu Leu Glu Asp Pro 
50 55 60 

Phe Glu Asp Leu Gly Ala Gin Leu Val Lys Ser Val Ala Thr Lys Thr 
65 70 75 80 

Asn Asp Val Ala Gly Asp Gly Thr Thr Thr Ala Thr lie Leu Ala Gin 
85 90 95 

Ala Leu lie Lys Gly Gly Leu Arg Leu Val Ala Ala Gly Val Asn Pro 
100 105 110 

lie Ala Leu Gly Val Gly lie Gly Lys Ala Ala Asp Ala Val Ser Glu 
115 120 125 

Ala Leu Leu Ala Ser Ala Thr Pro Val Ser Gly Lys Thr Gly lie Ala 
130 135 140 

Gin Val Ala Thr Val Ser Ser Arg Asp Glu Gin lie Gly Asp Leu Val 
145 150 155 160 

Gly Glu Ala Met Ser Lys Val Gly His Asp Gly Val Val Ser Val Glu 
165 170 175 

Glu Ser Ser Thr Leu Gly Thr Glu Leu Glu Phe *Thr Glu Gly lie Gly 
180 185 190 

Phe Asp Lys Gly Phe Leu Ser Ala Tyr Phe Val Thr Asp Phe Asp Asn 
195 200 205 

Gin Gin Ala Val Leu Glu Asp Ala Leu lie Leu Leu His Gin Asp Lys 
210 215 220 

lie Ser Ser Leu Pro Asp Leu Leu Pro Leu Leu Glu Lys Val Ala Gly 
225 230 235 240 

Thr Gly Lys Pro Leu Leu lie Val Ala Glu Asp Val Glu Gly Glu Ala 
245 250 255 

Leu Ala Thr Leu Val Val Asn Ala lie Arg Lys Thr Leu Lys Ala Val 
260 265 270 

Ala Val Lys Gly Pro Tyr Phe Gly Asp Arg Arg Lys Ala Phe Leu Glu 
275 280 285 



Asp Leu Ala Val Val Thr Gly Gly Gin Val Val 
290 295 



Asn Pro Asp Ala Gly 
300 



WO 99/35270 



42 



PCT/CA98/01203 



Met Val Leu Arg Glu Val Gly Leu Glu Val Leu Gly Ser Ala Arg Arg 
305 310 315 320 

Val Val Val Ser Lys Asp Asp Thr Val lie Val Asp Gly Gly Gly Thr 
325 330 335 

Ala Glu Ala Val Ala Asn Arg Ala Lys His Leu Arg Ala Glu lie Asp 
340 345 350 

Lys Ser Asp Ser Asp Trp Asp Arg Glu Lys Leu Gly Glu Arg Leu Ala 
355 360 365 

Lys Leu Ala Gly Gly Val Ala Val He Lys Val Gly Ala Ala Thr Glu 
370 375 380 

Thr Ala Leu Lys Glu Arg Lys Glu Ser Val Glu Asp Ala Val Ala Ala 
385 390 395 400 

Ala Lys Ala Ala Val Glu Glu Gly He Val Pro Gly Gly Gly Ala Ser 
405 410 415 

Leu He His Gin Ala Arg Lys Ala Leu Thr Glu Leu Arg Ala Ser Leu 
420 425 430 

Thr Gly Asp Glu Val Leu Gly Val Asp Val Phe Ser Glu Ala Leu Ala 
435 440 445 

Ala Pro Leu Phe Trp He Ala Ala Asn Ala Gly Leu Asp Gly Ser Val 
450 455 460 

Val Val Lys Lys Val Ser Glu Leu Pro Ala Gly His Gly Leu Asn Val 
465 470 475 480 

Asn Thr Leu Ser Tyr Gly Asp Leu Ala Ala Asp Gly Val He Asp Pro 
485 490 495 

Val Lys Val Thr Arg Ser Ala Val Leu Asn Ala Ser Ser Val Ala Arg 
500 505 510 

Met Val Leu Thr Thr Glu Thr Val Val Val Asp Lys Pro Ala Lys Ala 
515 520 525 

Glu Asp His Asp His His His Gly His Ala His 
530 535 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 545 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

Met Ala Ala Lys Asp Val Gin Phe Gly Asn Glu Val Arg Gin Lys Met 
15 10 15 

Val Asn Gly Val Asn lie Leu Ala Asn Ala Val Arg Val Thr Leu Gly 
20 25 30 

Pro Lys Gly Arg Asn Val Val Val Asp Arg Ala Phe Gly Gly Pro His 
35 40 45 

lie Thr Lys Asp Gly Val Thr Val Ala Lys Glu lie Glu Leu Lys Asp 
50 55 60 

Lys Phe Glu Asn Met Gly Ala Gin Met Val Lys Glu Val Ala Ser Lys 
65 70 75 80 

Thr Asn Asp Val Ala Gly Asp Gly Thr Thr Thr Ala Thr Val Leu Ala 
85 90 95 

Gin Ser lie Val Ala Glu Gly Met Lys Tyr Val Thr Ala Gly Met Asn 
100 105 110 



Pro Thr Asp Leu Lys Arg Gly lie Asp Lys Ala Val Ala Ala Leu Val 
115 120 125 

Glu Glu Leu Lys Asn lie Ala Lys Pro Cys Asp Thr Ser Lys Glu lie 
130 135 140 



Ala Gin Val Gly 
145 

He He Ala Glu 



Val Glu Asp Gly 
180 

Met Gin Phe Asp 
195 

Glu Lys Gin He 
210 

Lys Lys He Ser 
225 

Ala Lys Ala Ser 



Glu Ala Leu Ala 
260 



Ser He Ser Ala 
150 

Ala Met Glu Lys 
165 

Lys Ser Leu Glu 



Arg Gly Tyr Leu 
200 

Ala Gly Leu Asp 
215 

Asn He Arg Asp 
230 

Arg Pro Leu Leu 
245 

Thr Leu Val Val 



Asn Ser Asp Glu 
155 

Val Gly Lys Glu 
170 

Asn Glu Leu Asp 
185 

Ser Pro Tyr Phe 



Asn Pro Phe Val 
220 

Leu Leu Pro Val 
235 

He He Ala Glu 
250 

Asn Asn He Arg 
265 



Gin Val Gly Ala 
160 

Gly Val He Thr 
175 

Val Val Glu Gly 
190 

He Asn Asp Ala 
205 

Leu Leu Phe Asp 



Leu Glu Gin Val 
240 

Asp Val Glu Gly 
255 

Gly He Leu Lys 
270 
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Thr Val Ala Val Lys Ala Pro Gly Phe Gly Asp Arg Arg Lys Ala Met 
275 280 285 

Leu Gin Asp lie Ala lie Leu Thr Gly Gly Thr Val lie Ser Glu Glu 
290 295 300 

Val Gly Leu Ser Leu Glu Lys Ala Thr Leu Asp Asp Leu Gly Gin Ala 
305 310 315 320 

Lys Arg lie Glu lie Gly Lys Glu Asn Thr Thr lie lie Asp Gly Phe 
325 330 335 

Gly Asp Ala Ala Gin lie Glu Ala Arg Val Ala Glu lie Arg Gin Gin 
340 345 350 

lie Glu Thr Ala Thr Ser Asp Tyr Asp Lys Glu Lys Leu Gin Glu Arg 
355 360 365 

Val Ala Lys Leu Ala Gly Gly Val Ala Val lie Lys Val Gly Ala Ala 
370 375 380 

Thr Glu Val Glu Met Lys Glu Lys Lys Asp Arg Val Glu Asp Ala Leu 
385 390 395 400 

His Ala Thr Arg Ala Ala Val Glu Glu Gly Val Val Ala Gly Gly Gly 
405 410 415 

Val Ala Leu Leu Arg Ala Arg Ala Ala Leu Glu Asn Leu His Thr Gly 
420 425 430 

Asn Ala Asp Gin Asp Ala Gly Val Gin lie Val Leu Arg Ala Val Glu 
435 440 445 

Ser Pro Leu Arg Gin lie Val Ala Asn Ala Gly Gly Glu Pro Ser Val 
450 455 460 

Val Val Asn Lys Val Leu Glu Gly Lys Gly Asn Tyr Gly Tyr Asn Ala 
465 470 475 480 

Gly Ser Gly Glu Tyr Gly Asp Met lie Glu Met Gly Val Leu Asp Pro 
485 490 495 

Ala Lys Val Thr Arg Ser Ala Leu Gin His Ala Ala Ser lie Ala Gly 
500 505 510 

Leu Met Leu Thr Thr Asp Cys Met lie Ala Glu lie Pro Glu Glu Lys 
515 520 525 

Pro Ala Met Pro Asp Met Gly Gly Met Gly Gly Met Gly Gly Met Met 
530 535 540 



Glx 
545 
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(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 539 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIP 

Met Val Lys Gin Leu 
1 5 

Arg Gly Val Asp Gin 
20 

Lys Gly Arg Asn Val 
35 

Thr Asn Asp Gly Val 
50 

Tyr Glu Asn Met Gly 
65 

Asn Glu lie Ala Gly 
85 

Ala Met lie Gin Glu 
100 

Val Gly Leu Arg Gin 
115 

Ala Leu His Glu Asn 
130 

Gin Val Gly Ala lie 
145 

Ser Glu Ala Thr Glu 
165 

Thr lie Glu Glu Ser 
180 

Gin Phe Asp Arg Gly 
195 

Lys Met Val Ala Glu 
210 



TION: SEQ ID NO: 20: 

Lys Phe Ser Glu Asp Ala 
10 

Leu Ala Asn Ala Val Lys 
25 

Val Leu Asp Lys Glu Phe 
40 

Thr lie Ala Lys Glu lie 
55 

Ala Lys Leu Val Gin Glu 
70 75 

Asp Gly Thr Thr Thr Ala 
90 

Gly Leu Lys Asn Val Thr 
105 

Gly lie Asp Lys Ala Val 
120 

Ser Gin Lys Val Glu Asn 
135 

Ser Ala Ala Asp Glu Glu 
150 155 

Lys Val Gly Asn Asp Gly 
170 

Asn Arg Leu Asn Thr Glu 
185 

Tyr Gin Ser Pro Tyr Met 
200 

Leu Glu Arg Pro Tyr lie 
215 



Arg Gin Ala Met Leu 
15 

Val Thr lie Gly Pro 
30 

Thr Ala Pro Leu lie 
45 

Glu Leu Glu Asp Pro 
60 

Val Ala Asn Lys Thr 
80 

Thr Val Leu Ala Gin 
95 

Ser Gly Ala Asn Pro 
110 

Lys Val Ala Val Glu 
125 

Lys Asn Glu lie Ala 
14 0 

lie Gly Arg Tyr lie 
160 

Val lie Thr lie lie 
175 

Leu Glu Leu Gly Met 
190 

Val Thr Asp Ser Asp 
205 

Leu Val Thr Asp Lys 
220 
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Lys lie Ser Ser Phe Gin Asp lie Leu Pro Leu Leu Glu Gin Val Val 
225 230 235 240 

Gin Ser Asn Arg Pro lie Leu lie Val Ala Asp Glu Val Glu Gly Asp 
245 250 255 

Ala Leu Thr Asn lie Val Leu Asn Arg Met Arg Gly Thr Phe Thr Ala 
260 265 270 

Val Ala Val Lys Ala Pro Gly Phe Gly Asp Arg Arg Lys Ala Met Leu 
275 280 285 

Glu Asp Leu Ala lie Leu Thr Gly Ala Gin Val lie Thr Asp Asp Leu 
290 295 300 

Gly Leu Asp Leu Lys Asp Ala Ser lie Asp Met Leu Gly Thr Ala Ser 
305 310 315 320 

Lys Val Glu Val Thr Lys Asp Asn Thr Thr Val Val Asp Gly Asp Gly 
325 330 335 

Asp Glu Asn Ser lie Asp Ala Arg Val Ser Gin Leu Lys Ser Gin lie 
340 345 350 

Glu Glu Thr Glu Ser Asp Phe Asp Arg Glu Lys Leu Gin Glu Arg Leu 
355 360 365 

Ala Lys Leu Ala Gly Gly Val Ala Val lie Lys Val Gly Ala Ala Ser 
370 375 380 

Glu Thr Glu Leu Lys Glu Arg Lys Leu Arg lie Glu Asp Ala Leu Asn 
385 390 395 400 

Ser Thr Arg Ala Ala Val Glu Glu Gly lie Val Ala Gly Gly Gly Thr 
405 410 415 

Ala Leu Val Asn Val Tyr Gin Lys Val Ser Glu Asn Glu Ala Glu Gly 
420 425 430 

Asp lie Glu Thr Gly Val Asn lie Val Leu Lys Ala Leu Thr Ala Pro 
435 440 445 

Val Arg Gin lie Ala Glu Asn Ala Gly Leu Glu Gly Ser Val He Val 
450 455 460 

Glu Arg Leu Lys Asn Ala Glu Pro Gly Val Gly Phe Asn Gly Ala Thr 
465 470 475 480 

Asn Glu Trp Val Asn Met Leu Arg Arg Gly He Val Asp Pro Thr Lys 
485 490 495 



Val Thr Arg Ser Ala Leu Gin His Ala Ala Ser Val Ala Ala Met Phe 
500 505 510 
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Leu Thr Thr Glu Ala Val Val Ala Ser lie Pro Glu Lys Asn Asn Asp 
515 - 520 525 

Gin Pro Asn Met Gly Gly Met Pro Gly Met Met 
530 535 

(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 541 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:21: 

Met Ala Lys Ser lie lie Tyr Asn Asp Glu Ala Arg Arg Ala Leu Glu 
1 5 10 15 

Arg Gly Met Asp lie Leu Ala Glu Ala Val Ala Val Thr Leu Gly Pro 
20 25 30 

Lys Gly Arg Asn Val Val Leu Glu Lys Lys Phe Gly Ser Pro Gin lie 
35 40 45 

lie Asn Asp Gly He Thr He Ala Lys Glu He Glu Leu Glu Asp His 
50 55 60 

Val Glu Asn Thr Gly Val Ser Leu He Arg Gin Ala Ala Ser Lys Thr 
65 70 75 80 

Asn Asp Val Ala Gly Asp Gly Thr Thr Thr Ala Thr Val Leu Ala His 
85 90 95 

Ala He Val Lys Glu Gly Leu Arg Asn Val Ala Ala Gly Ala Asn Pro 
100 105 110 

He Ser Leu Lys Arg Gly He Asp Lys Ala Thr Asp Phe Leu Val Ala 
115 120 125 

Arg He Lys Glu His Ala Gin Pro Val Gly Asp Ser Lys Ala He Ala 
130 135 140 

Gin Val Gly Ala He Ser Ala Gly Asn Asp Glu Glu Val Gly Gin Met 
145 150 155 160 

He Ala Asn Ala Met Asp Lys Val Gly Gin Glu Gly Val He Ser Leu 
165 170 175 



Glu Glu Gly Lys Ser Met Thr Thr Glu Leu Glu He Thr Glu Gly Met 
180 185 190 
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Arg Phe Asp Lys Gly Tyr lie Ser Pro Tyr Phe Val Thr Asp Ala Glu 
195 200 205 

Arg Met Glu Ala Val Leu Glu Asp Pro Arg lie Leu He Thr Asp Lys 
210 215 220 

Lys He Asn Leu Val Gin Asp Leu Val Pro He Leu Glu Gin Val Ala 
225 230 235 240 

Arg Gin Gly Lys Pro Leu Leu He He Ala Glu Asp He Glu Lys Glu 
245 250 255 

Ala Leu Ala Thr Leu Val Val Asn Arg Leu Arg Gly Val Leu Asn Val 
260 265 270 

Ala Ala Val Lys Ala Pro Gly Phe Gly Asp Arg Arg Lys Gin Met Leu 
275 . 280 285 

Glu Asp He Ala Thr Leu Thr Gly Gly Gin Val He Ser Glu Asp Ala 
290 295 300 

Gly Leu Lys Leu Glu Ser Ala Thr Val Asp Ser Leu Gly Ser Ala Arg 
305 310 315 320 

Arg He Asn He Thr Lys Asp Asn Thr Thr He Val Ala Glu Gly Asn 
325 330 335 

Glu Ala Ala Val Lys Ser Arg Cys Glu Gin He Arg Arg Gin He Glu 
340 345 350 

Glu Thr Asp Ser Ser Tyr Asp Lys Glu Lys Leu Gin Glu Arg Leu Ala 
355 360 365 

Lys Leu Ala Gly Gly Val Ala Val He Lys Val Gly Ala Ala Thr Glu 
370 375 380 

Thr Glu Met Lys Asp Arg Lys Leu Arg Leu Glu Asp Ala He Asn Ala 
385 390 395 400 

Thr Lys Ala Ala Val Glu Glu Gly He Val Pro Gly Gly Gly Thr Thr 
405 410 415 



Leu Ala His Leu Ala Pro Gin Leu Glu Asp Trp Ala Thr Gly Asn Leu 
420 425 430 

Lys Asp Glu Glu Leu Thr Gly Ala Leu He Val Ala Arg Ala Leu Pro 
435 440 445 

Ala Pro Leu Lys Arg He Ala Glu Asn Ala Gly Gin Asn Gly Ala Val 
450 455 460 



He Ser Glu Arg Val Lys Glu Lys Glu Phe Asn Val Gly Tyr Asn Ala 
465 470 475 480 
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Ala Ser Leu Glu 



Ala Lys Val Thr 
500 

Met Val Leu Thr 
515 

Lys Ala Pro Ala 
530 



Tyr Val Asp Met 
485 

Arg Ser Ala Leu 



Thr Glu Cys lie 
520 

Gly Ala Pro Gly 
535 



Leu Ala Ala Gly 
490 

Gin Asn Ala Ala 
505 

Val Val Asp Lys 



Gly Asp Phe Asp 
540 



lie Val Asp Pro 
495 

Ser lie Ala Gly 
510 

Pro Glu Lys Glu 
525 

Tyr 



(2) INFORMATION FOR SEQ ID NO:22: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 552 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO:22: 

Met Ser Lys Leu lie Ser Phe Lys Asp Glu Ser Arg Arg Ser Leu Glu 
15 10 15 

Ala Gly lie Asn Ala Leu Ala Asp Ala Val Arg lie Thr Leu Gly Pro 
20 25 30 

Lys Gly Arg Asn Val Leu Leu Glu Lys Gin Tyr Gly Ala Pro Gin lie 
35 40 45 

Val Asn Asp Gly lie Thr Val Ala Lys Glu lie Glu Leu Ser Asn Pro 
50 55 60 

Glu Glu Asn Ala Gly Ala Lys Leu lie Gin Glu Val Ala Ser Lys Thr 
65 70 75 80 

Lys Glu lie Ala Gly Asp Gly Thr Thr Thr Ala Thr lie lie Ala Gin 
85 90 95 

Ala Leu Val Arg Glu Gly Leu Arg Asn Val Ala Ala Gly Ala Asn Pro 
100 105 110 

Val Ala Leu Arg Arg Gly lie Glu Lys Val Thr Thr Phe Leu Val Gin 
115 120 125 

Glu lie Glu Ala Val Ala Lys Pro Val Glu Gly Ser Ala lie Ala Gin 
130 135 140 



Val Ala, Thr Val Ser Ser Gly Asn Asp Pro Glu Val Gly Ala Met lie 
145 150 155 160 
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Ala Asp Ala Met Asp Lys Val Thr Lys Asp Gly Val lie Thr Val Glu 
165 170 175 

Glu Ser Lys Ser Leu Asn Thr Glu Leu Glu Val Val Glu Gly Met Gin 
180 185 190 

lie Asp Arg Gly Tyr lie Ser Pro Tyr Phe lie Thr Asp Ser Asp Arg 
195 200 205 

Gin Leu Val Glu Phe Asp Asn Pro Leu lie Leu lie Thr Asp Lys Lys 
210 215 220 

lie Ser Ala lie Ala Glu Leu Val Pro Val Leu Glu Ala Val Ala Arg 
225 230 235 240 

Ala Gly Arg Pro Leu Leu lie lie Ala Glu Asp lie Glu Gly Glu Ala 
245 250 255 

Leu Ala Thr Leu Val Val Asn Lys Ala Arg Gly Val Leu Asn Val Ala 
260 265 270 

Ala lie Lys Ala Pro Ala Phe Gly Asp Arg Arg Lys Ala Val Leu Gin 
275 280 285 

Asp lie Ala lie Leu Thr Gly Gly Ser Val lie Ser Glu Asp lie Gly 
290 295 300 

Leu Ser Leu Asp Thr Val Ser Leu Asp Gin Leu Gly Gin Ala Val Lys 
305 310 315 320 

Ala Thr Leu Glu Lys Asp Asn Thr lie Leu Val Ala Gly Ala Asp Lys 
325 330 335 

Arg Ala Ser Ala Gly Val Lys Glu Arg lie Glu Gin Leu Arg Lys Glu 
340 345 350 

Tyr Ala Ala Ser Asp Ser Asp Tyr Asp Lys Glu Lys lie Gin Glu Arg 
355 360 365 

lie Ala Lys Leu Ala Gly Gly Val Ala Val lie Lys Val Gly Ala Ala 
370 375 380 

Thr Glu Thr Glu Leu Lys Asp Arg Lys Leu Arg lie Glu Asp Ala Leu 
385 390 395. 400 

Asn Ala Thr Lys Ala Ala Val Glu Glu Gly lie Val Pro Gly Gly Gly 
405 410 415 

Thr Thr Leu lie Arg Leu Ala Gly Lys lie Glu Ser Phe Lys Ala Gin 
420 425 430 

Leu Ser Asn Asp Glu Glu Arg Val Ala Ala Asp lie lie Ala Lys Ala 
435 440 445 
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Leu Glu Ala Pro Leu His Gin Leu Ala Ser Asn Ala Gly Val Glu Gly 
450 455 460 

Ser Val He Val Glu Lys Val Lys Glu Ala Thr Gly Asn Gin Gly Tyr 
465 470 475 480 

Asn Val He Thr Gly Lys He Glu Asp Leu He Ala Ala Gly He He 
485 490 495 

Asp Pro Ala Lys Val Val Arg Ser Ala Leu Gin Asn Ala Ala Ser He 
500 505 510 

Ala Gly Met Val Leu Thr Thr Glu Ala Leu Val Val Glu Lys Pro Glu 
515 520 525 

Pro Ala Ala Pro Ala Met Pro Asp Met Gly Gly Met Gly Gly Met Gly 
530 535 540 

Gly Met Gly Gly Met Gly Met Met 
545 550 

(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 53 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

Met Ala Lys Thr He Ala Phe Asp Lys Lys Ala Arg Arg Gly Leu Glu 
15 10 15 

Arg Gly Leu Asn Ala Leu Ala Asp Ala Val Lys Val Thr Leu Gly Pro 
20 25 30 

Lys Gly Arg Asn Val Val Leu Glu Lys Lys Trp Gly Ala Pro Thr He 
35 40 45 

Thr Asn Asp Gly Val Ser He Ala Lys Glu He Glu Leu Glu Asp Pro 
50 55 60 

Tyr Glu Lys He Gly Ala Glu Leu Val Lys Glu Val Ala Lys Lys Thr 
65 70 75 80 

Asp Asp Val Ala Gly Asp Gly Thr Thr Thr Ala Thr Val Leu Ala Gin 
85 90 95 



Ala Leu Val 



Arg Glu Gly Leu Arg Asn Val Ala Ala Gly Ala Asn Pro 
100. 105 110 
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Leu Gly Leu Lys Arg Gly He Glu Lys Ala Val Glu Ala Val Thr Glu 
115 120 125 

His Leu Leu Lys Ala Ala Lys Glu Val Glu Thr Lys Asp Gin He Ala 
130 135 140 

Ala Thr Ala Gly He Ser Ala Gly Asp Pro Ala He Gly Glu Leu He 
145 150 155 160 

Ala Glu Ala Met Asp Lys Val Gly Lys Glu Gly Val He Thr Val Glu 
165 170 175 

Glu Ser Asn Thr Phe Gly Leu Gin Leu Glu Leu Thr Glu Gly Met Arg 
180 185 190 

Phe Asp Lys Gly Phe He Ser Gly Tyr Phe Ala Thr Asp Ala Glu Arg 
195 200 205 

Gin Glu Ala Val Leu Glu Asp Pro Tyr Val Leu Leu Val Ser Gly Lys 
210 215 220 

He Ser Thr Val Lys Asp Leu Leu Pro Leu Leu Glu Lys Val He Gin 
225 230 235 240 

Ser Gly Lys Pro Leu Ala He He Ala Glu Asp Val Glu Gly Glu Ala 
245 250 255 

Leu Val Thr Leu He Val Asn Lys He Arg Gly Thr Phe Lys Ser Val 
260 265 270 

Ala He Lys Ala Pro Gly Phe Gly Asp Arg Arg Lys Ala Met Leu Gin 
275 280 285 

Asp Met Ala He Leu Thr Gly Gly Gin Val He Ser Glu Glu He Gly 
290 295 300 

Leu Ser Leu Asp Thr Ala Gly Leu Glu Val Leu Gly Gin Ala Arg Gin 
305 310 315 320 

Val Val Val Thr Lys Asp Glu Thr Thr He Val Asp Gly Ala Gly Ser 
325 330 335 

Lys Glu Gin He Ala Gly Arg Val Ser Gin He Arg Ala Glu He Glu 
340 345 350 



Asn Ser Asp Ser Asp Tyr Asp Arg Glu Lys Leu Gin Glu Arg Leu Ala 
355 360 365 

Lys Leu Ala Gly Gly Val Ala Val He Lys Ala Gly Ala Ala Thr Glu 
370 375 380 

Asp Leu Lys Glu Arg Lys His Arg He Glu Asp Ala Val Arg Asn Ala 
385 390 395 400 
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Lys Ala Ala Val Glu Glu Gly lie Val Ala Gly Gly Gly Ser Ser Leu 
405 410 415 

Ala Gin Ser Gly Thr Val Phe Asp Ser Xaa Ala Leu Glu Gly Asp Glu 
420 425 430 

Ala Thr Gly Ala Asn lie Val Lys Val Ala Leu Asp Ala Pro Val Lys 
435 440 445 

Gin lie Ala Val Asn . Ala Gly Leu Glu Pro Gly Val Val Ala Glu Lys 
450 455 460 

Val Arg Asn Ser Pro Ala Gly Thr Gly Leu Asn Ala Ala Thr Gly Val 
465 470 475 480 

Tyr Glu Asp Leu Leu Ala Ala Gly lie Asn Asp Pro Val Lys Val Thr 
485 490 495 

Arg Ser Ala Leu Gin Asn Ala Ala Ser lie Ala Ala Leu Phe Leu Thr 
500 505 510 

Thr Glu Ala Val Val Ala Asp Lys Pro Glu Lys Ala Gly Ala Pro Val 
515 520 525 

Asp Pro Thr Gly Gly Met Gly Gly Met Asp Phe 
530 535 

(2) INFORMATION FOR SEQ ID NO:24: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 582 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:24: 

Met Val Ser Phe Leu Ser Ser Ser Val Ser Arg Leu Pro Leu Arg lie 
1 5 10 15 

Ala Gly Arg Arg lie Pro Gly Arg Phe Ala Val Pro Gin Val Arg Thr 
20 25 30 

Tyr Ala Lys Asp Leu Lys Phe Gly Val Asp Ala Arg Ala Ser Leu Leu 
35 40 45 

Thr Gly Val Asp Thr Leu Ala Arg Ala Val Ser Val Thr Leu Gly Pro 
50 55 60 

Lys Gly Arg Asn Val Leu lie Asp Gin Pro Phe Gly Ser Pro Lys lie 
65 70 75 80 
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Thr Lys Asp Gly Val Thr Val Ala Arg Ser Val Ser Leu Lys Asp Lys 
85 90 95 

Phe Glu Asn Leu Gly Ala Arg Leu Val Gin Asp Val Ala Ser Lys Thr 
100 105 110 

Asn Glu Val Ala Gly Asp Gly Thr Thr Thr Ala Thr Val Leu Thr Arg 
115 120 125 

Ala lie Phe Ser Glu Thr Val Arg Asn Val Ala Ala Gly Cys Asn Pro 
130 135 140 

Met Asp Leu Arg Arg Gly lie Gin Leu Ala Val Asp Asn Val Val Glu 
145 150 155 160 

Phe Leu Gin Ala Asn Lys Arg Asp He Thr Thr Ser Glu Glu He Ser 
165 170 175 

Gin Val Ala Thr He Ser Ala Asn Gly Asp Thr His He Gly Glu Leu 
180 185 190 

Leu Ala Lys Ala Met Glu Arg Val Gly Lys Glu Gly Val He Thr Val 
195 200 205 

Lys Glu Gly Arg Thr He Ser Asp Glu Leu Glu Val Thr Glu Gly Met 
210 215 220 

Lys Phe Asp Arg Gly Tyr He Ser Pro Tyr Phe He Thr Asp Val Lys 
225 230 235 240 

Ser Gin Lys Val Glu Phe Glu Asn Pro Leu He Leu Leu Ser Glu Lys 
245 250 255 

Lys Val Ser Ala Val Gin Asp He Leu Pro Ser Leu Glu Leu Ala Ala 
260 265 270 

Gin Gin Arg Arg Pro Leu Val He He Ala Glu Asp Val Asp Gly Glu 
275 280 285 

Ala Leu Ala Ala Cys He Leu Asn Lys Leu Arg Gly Gin Leu Gin Val 
290 295 300 

Val Ala He Lys Ala Pro ,Gly Phe Gly Asp Asn Arg Arg Asn Met Leu 
305 310 315 320 

Gly Asp Leu Ala Val Leu Thr Asp Ser Ala Val Phe Asn Asp Glu He 
325 330 335 

Asp Val Ser He Glu Lys Ala Gin Pro His His Leu Gly Ser Cys Gly 
340 345 350 

Ser Val Thr Val Thr Lys Glu Asp Thr He He Met Lys Gly Ala Gly 
355 360 365 
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Asp His Val Lys Val Asn Asp Arg Cys Glu Gin lie Arg Gly Val Met 
370 375 380 

Ala Asp Pro Asn Leu Thr Glu Ser Glu Lys Glu Lys Leu Gin Glu Arg 
385 390 395 400 

Leu Ala Lys Leu Ser Gly Gly lie Ala Val lie Lys Val Gly Ala Ser 
405 410 415 

Ser Glu Val Glu Val Asn Glu Lys Lys Asp Arg lie Val Asp Ala Leu 
420 425 430 

Asn Ala Val Lys Ala Ala Val Ser Glu Gly Val Leu Pro Gly Ala Gly 
435 440 445 

Thr Ser Phe Val Lys Ala Ser Leu Arg Leu Gly Asp lie Pro Thr Asn 
450 455 460 

Asn Phe Asp Gin Lys Leu Gly Val Glu lie Val Arg Lys Ala lie Thr 
465 470 475 480 

Arg Pro Ala Gin Thr lie Leu Glu Asn Ala Gly Leu Glu Gly Asn Leu 
485 490 495 

lie Val Gly Lys Leu Lys Glu Leu Tyr Gly Lys Glu Phe Asn lie Gly 
500 505 510 

Tyr Asp lie Ala Lys Asp Arg Phe Val Asp Leu Asn Glu lie Gly Val 
515 520 525 

Leu Asp Pro Leu Lys Val Val Arg Thr Gly Leu Val Asp Ala Ser Gly 
530 535 540 

Val Ala Ser Leu Met Gly Thr Thr Glu Cys Ala lie Val Asp Ala Pro 
545 550 555 560 

Glu Glu Ser Lys Ala Pro Ala Gly Pro Pro Gly Met Gly Gly Met Gly 
565 570 575 

Gly Met Pro Gly Met Met 
580. 

(2) INFORMATION FOR SEQ ID NO:25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 572 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

( D ) TOPOLOGY : linear 



(xi) 



SEQUENCE DESCRIPTION: SEQ ID NO: 25: 
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Met Leu Arg Ser Ser Val Val Arg Ser Arg Ala Thr Leu Arg Pro Leu 
15 10 15 

Leu Arg Arg Ala Tyr Ser Ser His Lys Glu Leu Lys Phe Gly Val Glu 
20 25 30 

Gly Arg Ala Ser Leu Leu Lys Gly Val Glu Thr Leu Ala Glu Ala Val 
35 40 45 

Ala Ala Thr Leu Gly Pro Lys Gly Arg Asn Val Leu lie Glu Gin Pro 
50 55 60 

Phe Gly Pro Pro Lys lie Thr Lys Asp Gly Val Thr Val Ala Lys Ser 
65 70 75 80 

lie Val Leu Lys Asp Lys Phe Glu Asn Met Gly Ala Lys Leu Leu Gin 
85 90 95 

Glu Val Ala Ser Lys Thr Asn Glu Ala Ala Gly Asp Gly Thr Thr Ser 
100 105 110 

Ala Thr Val Leu Gly Arg Ala lie Phe Thr Glu Ser Val Lys Asn Val 
115 120 125 

Ala Ala Gly Cys Asn Pro Met Asp Leu Arg Arg Gly Ser Gin Val Ala 
130 135 140 

Val Glu Lys Val lie Glu Phe Leu Ser Ala Asn Lys Lys Glu lie Thr 
145 150 155 160 

Thr Ser Glu Glu lie Ala Gin Val Ala Thr lie Ser Ala Asn Gly Asp 
165 170 175 

Ser His Val Gly Lys Leu Leu Ala Ser Ala Met Glu Lys Val Gly Lys 
180 185 190 

Glu Gly Val lie Thr lie Arg Glu Gly Arg Thr Leu Glu Asp Glu Leu 
195 200 205 

Glu Val Thr Glu Gly Met Arg Phe Asp Arg Gly Phe lie Ser Pro Tyr 
210 215 220 

Phe lie Thr Asp Pro Lys Ser Ser Lys Val Glu Phe Glu Lys Pro Leu 
225 230 235 240 

Leu Leu Leu Ser Glu Lys Lys lie Ser Ser lie Gin Asp lie Leu Pro 
245 250 255 

Ala Leu Glu lie Ser Asn Gin Ser Arg Arg Pro Leu Leu lie lie Ala 
260 265 270 



Glu Asp Val Asp Gly Glu Ala Leu Ala Ala Cys lie Leu Asn Lys Leu 
275 280 285 
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Arg Gly Gin Val Lys Val Cys Ala Val Lys Ala Pro Gly Phe Gly Asp 
290 295 300 

Asn Arg Lys Asn Thr lie Gly Asp lie Ala Val Leu Thr Gly Gly Thr 
305 310 315 320 

Val Phe Thr Glu Glu Leu Asp Leu Lys Pro Glu Gin Cys Thr lie Glu 
325 330 335 

Asn Leu Gly Ser Cys Asp Ser lie Thr Val Thr Lys Glu Asp Thr Val 
340 345 350 

lie Leu Asn Gly Ser Gly Pro Lys Glu Ala lie Gin Glu Arg lie Glu 
355 360 365 

Gin lie Lys Gly Ser lie Asp lie Thr Thr Thr Asn Ser Tyr Glu Lys 
' 370 375 380 

Glu Lys Leu Gin Glu Arg Leu Ala Lys Leu Ser Gly Gly Val Ala Val 
. 385 390 395 400 

lie Arg Val Gly Gly Ala Ser Glu Val Glu Val Gly Glu Lys Lys Asp 
405 410 415 

Arg Tyr Asp Asp Ala Leu Asn Ala Thr Arg Ala Ala Val Glu Glu Gly 
420 425 430 

lie Leu Pro Gly Gly Gly Thr Ala Leu Val Lys Ala Ser Arg Val Leu 
435 440 445 

Asp Glu Val Val Val Asp Asn Phe Asp Gin Lys Leu Gly Val Asp lie 
450 455 460 

v 

lie Arg Lys Ala lie Thr Arg Pro Ala Lys Gin lie lie Glu Asn Ala 
465 470 475 480 

Gly Glu Glu Gly Ser Val lie lie Gly Lys Leu lie Asp Glu Tyr Gly 
485 490 495 

Asp Asp Phe Ala Lys Gly Tyr Asp Ala Ser Lys Ser Glu Tyr Thr Asp 
500 505 510 

Met Leu Ala Thr Gly lie lie Asp Pro Phe Lys Val Val Arg Ser Gly 
515 520 525 

Leu Val Asp Ala Ser Gly Val Ala Ser Leu Leu Ala Thr Thr Glu Val 
530 535 540 

Ala lie Val Asp Ala Pro Glu Pro Pro Ala Ala Ala Gly Ala Gly Gly 
545 550 555 560 

Met Pro Gly Gly Met Pro Gly Met Pro Gly Met Met 
565 570 

(2) INFORMATION FOR SEQ ID NO: 26: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 577 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

Met lie Ser Thr Leu Arg Gly Lys lie Phe Asn Asn Gly Ser Asn Arg 
15 10 15 

Asn Lys Cys Val Ser lie Leu Ser Asn lie Gin Lys Arg Asn lie Ser 
20 25 30 

Lys Asp lie Arg Phe Gly Ser Asp Ala Arg Thr Ala Met Leu Thr Gly 
35 40 45 

Cys Asn Lys Leu Ala Asp Ala Val Ser Val Thr Leu Gly Pro Lys Gly 
50 55 60 

Arg Asn Val lie lie Glu Gin Ser Phe Gly Ser Pro Lys lie Thr Lys 
65 70 75 80 

Asp Gly Val Thr Val Ala Lys Ser lie Glu Phe Asn Asn Lys Leu Ala 
85 90 95 

Asn Leu Gly Ala Gin Met Val Lys Gin Val Ala Ala Asn Thr Asn Gly 
100 105 110 

Lys Ala Gly Asp Gly Thr Thr Thr Ala Thr lie Leu Ala Arg Ser lie 
115 120 125 

Phe Gin Gin Gly Cys Lys Ala Val Asp Ser Gly Met Asn Pro Met Asp 
130 13^"^^ 140 

Leu Leu Arg Gly lie Asn Lys Gly Val Glu Lys Val Leu Glu Tyr Leu 
145 150 155 160 

Asn Ser lie Lys Lys Asp Val Thr Thr Thr Glu Glu lie Phe Asn Val 
165 170 175 

Ala Ser lie Ser Asn Gly Asp Lys Asn lie Gly Gin Leu lie Ala Asp 
180 185 190 

Thr Met Lys Lys Val Gly Lys Glu Gly Thr lie Thr Val Thr Glu Gly 
195 200 205 



Lys Thr Leu Gin His Glu Leu 
210 215 



Glu He Val Glu Gly He Lys Phe Asp 
220 
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Arg Gly Tyr lie 
225 

Leu Asp Lys Pro 



Ser Pro Tyr Phe 
230 

Tyr lie Leu lie 
245 



lie Asn Asn Ser 
235 

His Glu Lys Lys 
250 



Gin Lys Val Glu 
240 

lie Ser Thr Val 
255 



Lys Ser Leu Leu Pro Val Leu Glu His Val Leu Gin Asn Gin Ser Ser 
260 265 270 

Leu Leu Val lie Ala Glu Asp Val Asp Ser Asp Ala Leu Ala Thr Leu 
275 280 285 

lie Val Asn Lys Leu Arg Leu Gly Leu Lys lie Cys Ala Val Lys Ala 
290 295 300 



Pro Gly Phe Gly Glu His Arg Lys Ala Leu lie His Asp lie Ala Val 
305 310 315 320 

Met Thr Gly Ala Lys Val lie Thr Glu Glu Thr Gly Leu Lys Leu Asp 
325 330 335 

Asp Pro Gin Val Val Ser Tyr Leu Gly Lys Ala Lys Ser lie Asn Val 
340 345 350 

Thr Lys Asp Ser Thr Leu lie Met Glu Gly Glu Gly Lys Lys Glu Glu 
355 360 365 

lie Asn Glu Arg Cys Glu Ser lie Arg Asn Ala lie Lys Met Asn Thr 
370 375 380 

Ser Asp Tyr Glu Lys Glu Lys Leu Gin Glu Arg Leu Ala Lys lie Thr 
385 390 395 400 

Gly Gly Val Ala Leu lie Lys Val Gly Gly lie Ser Glu Val Glu Val 
405 410 415 

Asn Glu He Lys Asp Arg He Gin Asp Ala Leu Cys Ala Thr Lys Ala 
420 425 430 

Ala Val Glu Glu Gly He Val Pro Gly Gly Gly Ser Ala Leu Leu Phe 
435 440 445 

Ala Ser Lys Glu Leu Asp Ser Val Gin Thr Asp Asn Tyr Asp Gin Arg 
450 455 460 

Val Gly Val Asn He He Lys Asp Ala Cys Lys Ala Pro He Lys Gin 
465 470 475 480 

He Ala Glu Asn Ala Gly His Glu Gly Ser Val Val Ala Gly Asn He 
485 490 495 

Leu Lys Asp Lys Asn Ser Asn He Gly Phe Asn Ala Gin Glu Gly Lys 
500 505 510 



Tyr Val Asp Met He Glu Ser Gly He He Asp Pro Thr Lys Val Val 
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515 1 520 525 

Lys Thr Ala lie Ser Asp Ala Ala Ser lie Ala Ser Leu Met Thr Thr 
530 535 540 

Thr Glu Val Ala lie Val Asp Phe Lys Asp Ser Lys Asn Glu Glu Ser 
545 550 555 560 

Ser Gin His Met Asn Ser Val Asn Ser Met Gly Asp Met Gly Gly Met 
565 570 575 

Tyr 



(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 550 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 



Met Thr Asn Val 
1 

Glu Val Ala Ala 
20 

Arg Gly Lys Thr 
35 

Thr Lys Asp Gly 
50 

Leu Asn Ala Ala 
65 

Asn Asp Lys Val 



Val Val Ser Gly 
5 

Val lie Asp Ser 



Val Gly lie Asn 
40 

Tyr Lys Val Met 
55 

lie Thr Ser lie 
70 

Gly Asp Gly Thr 
85 



Glu Gin Leu Gin 
10 

Thr Val Ala Val 
25 

Lys Pro Tyr Gly 



Lys Gly lie Lys 
60 

Phe Ala Gin Ser 
75 

Thr Thr Cys Ser 
90 



Gin Ala Phe Arg 
15 

Thr Ala Gly Pro 
30 

Ala Pro Glu lie 
45 

Pro Glu Lys Pro 



Cys Ser Gin Cys 
80 

lie Leu Thr Ser 
95 



Gly Met He Val Glu 
100 

He Ser He Lys Asn 
115 

Glu Val Ala Ser Met 
130 



Ala Ser Lys Ser He Ala 
105 

Gly Met Gin Lys Ala Lys 
120 

Ala Arg Thr He Ser Leu 
135 



Ala Gly Asn Asp Arg 
110 

Asp Val Val Leu Lys 
125 

Glu Lys He Asp Glu 
140 
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Val Ala Gin Val Ala lie lie Ser Ala Asn Gly Asp Arg Ser lie Gly 
145 150 155 160 

Ser Asn lie Ala Asp Ala Val Lys Lys Val Gly Lys Glu Gly Val lie 
165 170 175 

Thr Val Glu Glu Ser Lys Gly Ser Lys Glu Leu Glu Val Glu Leu Thr 
180 185 190 

Thr Gly Met Gin Phe Asp Arg Gly Tyr Leu Ser Pro Tyr Phe lie Thr 
195 200 205 

Asn Asn Glu Lys Met lie Val Glu Leu Asp Asp Pro Tyr Leu Leu lie 
210 215 220 

Thr Glu Lys Lys Leu Asn lie lie Gin Pro Leu Leu Ser lie Leu Glu 
225 230 235 240 

Ala Val Val Lys Ser Gly Lys Pro Leu Leu lie lie Ala Glu Asp lie 
245 250 255 

Glu Gly Glu Ala Leu Ser Thr Leu Val lie Asn Lys Leu Arg Gly Gly 
260 265 270 

Leu Lys Val Ala Ala Val Lys Ala Pro Gly Phe Gly Asp Arg Arg Lys 
275 280 -285 

Glu Met Leu Glu Asp lie Ala Ala Leu Thr Asn Ala Lys Tyr Val He 
290 295 300 

Lys Asp Glu Leu Gly He Lys Met Glu Asp Leu Thr Leu Glu Asp Leu 
305 310 315 320 

Gly He Ala Lys Asn Val Lys He Thr Lys Asp Asn Thr Thr He Val 
325 330 335 

Ser Glu Asn Arg Val Thr Asp Arg Val Lys Ala Arg lie Glu Gin He 
340 345 350 

Lys Ser Gin He Glu Ser Ser Thr Ser Asp Tyr Asp Lys Glu Lys Leu 
355 360 365 

Arg Glu Arg Leu Ala Lys Leu Ser Gly Gly Val Ala Val Leu Lys Val 
370 375 380 

Gly Gly Ala Thr Glu Leu Glu Val Lys Glu Arg Arg Asp Arg Val Glu 
385 390 395 400 

Asp Gin Leu His Ala Thr Arg Ala Ala lie Glu Glu Gly He Val Pro 
405 410 415 

Gly Gly Gly Val Ala Leu Leu Tyr Ala Ser Ser Ala Leu Asp Lys Leu 
420 425 430 



Lys Gly Ala Asp Asp Glu Glu Gin He Gly lie Asn He He Lys Lys 
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435 440 445 

Val Leu Ser Val Pro lie Lys Arg Leu Val Lys Asn Ala Gly Leu Glu 
450 455 460 

Ser Ala Val lie lie Asp Tyr Leu lie Lys Gin Asn Asn Lys Glu Leu 
465 470 475 480 

lie Tyr Asn Val Glu Ala Met Ser Tyr Ala Asn Ala Phe Ala Ala Gly 
485 490 495 

Val lie Asp Pro Ala Lys Val Val Arg lie Ala Phe Glu Thr Ala lie 
500 505 510 

Ser Val Ala Ser Val Leu lie Thr Thr Glu Ser Met lie Val Asp lie 
515 520 525 

Pro Asn Lys Asp Glu Asn Ala Ser Ser Pro Met Gly Ala Gly Gly Met 
530 535 540 

Gly Arg Met Asn Asp Phe 
545 550 

(2) INFORMATION FOR SEQ ID NO:28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 568 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNESS : 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:28: 

Met Leu Arg Leu Ala Arg Lys Gly Leu Gin Thr Ala Val Val Arg Ser 
15 10 15 

Tyr Ala Lys Asp Val Lys Phe Gly Ala Glu Gly Arg Gin Ala Met Leu 
20 25 30 

Val Gly Val Asn Leu Leu Ala Asp Ala Val Ser Val Thr Met Gly Pro 
35 40 45 

Lys Gly Arg Asn Val lie lie Glu Gin Ser Trp Gly Ser Pro Lys lie 
50 55 60 

Thr Lys Asp Gly Val Thr Val Ala Lys Ser lie Asp Leu Lys Asp Lys 
65 70 75 80 

Tyr Gin Asn Leu Gly Ala Lys Leu lie Gin Asp Val Ala Asn Lys Ala 
85 90 95 
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Asn Glu Glu Ala Gly Asp Gly Thr Thr Cys Ala Thr Val Leu Thr Arg 
100 105 110 

Ala lie Ala Lys Glu Gly Phe Glu Arg His Ser Ser Arg Gly Asn Ala 
115 120 125 

Val Glu lie Arg Arg Gly Val Met Asn Ala Val Glu Val Val Val Ala 
130 135 140 

Glu Leu Lys Lys lie Ser Lys Lys Val Thr Thr Pro Glu Glu lie Ala 
145 150 155 160 

Gin Val Ala Thr lie Ser Ala Asn Gly Asp Thr Val Val Gly Asn Leu 
165 170 175 

lie Ser Asp Ala Met Lys Lys Val Gly Thr Thr Gly Val lie Thr Val 
180 185 190 

Lys Asp Gly Lys Thr Leu Asn Asp Gin Leu Glu Leu lie Glu Gly Met 
195 200 205 

Lys Phe Asp Arg Gly Tyr lie Ser Pro Tyr Phe lie Thr Ser Ala Lys 
210 215 220 

Gly Ala Lys Val Glu Tyr Glu Lys Ala Leu Val Leu Leu Ser Glu Lys 
225 230 235 240 

Lys lie Ser Gin Val Gin Asp lie Val Pro Ala Leu Glu Leu Ala Asn 
245 250 255 

Lys Leu Arg Arg Pro Leu Val lie lie Ala Glu Asp Val Asp Gly Glu 
260 265 270 

Ala Leu Thr Thr Leu Val Leu Asn Arg Leu Lys Val Gly Leu Gin Val 
275 280 285 

Val Ala lie Lys Ala Pro Gly Phe Gly Asp Asn Arg Lys Asn Ala Leu 
290 295 300 

Lys Asp Met Gly lie Ala Thr Gly Ala Ser lie Phe Gly Asp Glu Thr 
305 310 315 320 

Leu Asp Leu Arg Leu Glu Asp lie Thr Ala Asn Asp Leu Gly Glu Val 
325 330 335 

Asp Glu Val Thr lie Thr Lys Asp Asp Thr Leu Leu Leu Arg Gly Arg 
340 345 350 

Gly Asp Gin Thr Glu lie Glu Lys Arg lie Glu Glu lie Thr Asp Glu 
355 360 365 

lie Glu Arg Ser Thr Ser Asp Tyr Glu Lys Glu Lys Leu Asn Glu Arg 
370 375 380 



Leu Ala Lys Leu Ser Lys Gly Val Ala Val Leu Lys lie Gly Gly Gly 
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385 390 395 400 

Ser Glu Val Glu Val Gly Glu Lys Lys Asp Arg Val Thr Asp Ala Leu 
405 410 415 

Cys Ala Thr Arg Ala Ala Val Glu Glu Gly lie Val Pro Gly Gly Gly 
420 425 430 

Val Ala Leu Leu Arg Ser Leu Thr Ala Leu Lys Asn Tyr Lys Ala Ala 
435 440 445 

Asn Glu Asp Gin Gin lie Gly Val Asn lie Val Lys Lys Ala Leu Thr 
450 455 460 

Gin Pro He Ala Thr He Val Lys Asn Ala Gly Leu Glu Pro Ser Ser 
465 470 475 480 

He He Asp Glu Val Thr Gly Asn Ser Asn Thr Ser Tyr Gly Tyr Asp 
485 490 495 

Ala Leu Asn Gly Lys Phe Val Asp Met Phe Glu Ala Gly He He Asp 
500 505 510 

Pro Thr Lys Val Val Arg Thr Ala Leu Gin Asp Ala Ser Gly Val Ala 
515 520 525 

Ser Leu Leu Ala Thr Thr Glu Cys Val Val Thr Glu He Pro Lys Glu 
530 535 540 

Glu Ala Val Gly Gly Pro Ala Gly Gly Met Gly Gly Met Gly Gly Met 
545 550 555 560 

Gly Gly Met Gly Gly Met Gly Phe 
565 

(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 576 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

<D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:29: 

Met Phe Arg Leu Pro Val Ser Leu Ala Arg Ser Ser He Ser Arg Gin 
15 10 15 

Leu Ala Met Arg Gly Tyr Ala Lys Asp Val Arg Phe Gly Pro Glu Val 
20 25 30 
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Arg Ala Met Met Leu Gin Gly Val Asp Val Leu Ala Asp Ala Val Ala 
35 40 45 

Val Thr Met Gly Pro Lys Gly Arg Asn Val lie He Glu Gin Ser Val 
50 55 60 

Gly Leu Ala Lys He Thr Lys Asp Gly Val Thr Val Ala Lys Ser He 
65 70 75 80 

Glu Leu Lys Asp Lys Phe Gin Asn He Gly Ala Lys Leu Val Gin Asp 
85 90 95 

Leu Ala Asn Asn Thr Asn Glu Glu Ala Gly Asp Gly Thr Thr Thr Ala 
100 105 110 

Thr Phe Leu Ala Arg Ala He Ala Lys Glu Gly Phe Glu Lys He Ser 
115 120 125 

Lys Gly Gly Asn Pro Val Glu He Arg Arg Gly Val Met Leu Ala Val 
130 135 140 

Glu Thr Val Lys Asp Asn Leu Lys Thr Met Ser Arg Pro Val Ser Thr 
145 150 155 160 

Pro Glu Glu He Ala Gin Val Ala Thr He Ser Ala Asn Gly Asp Arg 
165 170 175 

Glu lie Gly Asn Gly Lys Val Ser Val Ser Glu Ala Met Lys Lys Val 
180 185 190 

Gly Arg Asp Gly Val He Thr Val Lys Asp Gly Lys Thr Leu Thr Asp 
195 200 205 

Glu Leu Glu Val He Glu Gly Thr Met Arg Phe Asp Arg Gly Tyr He 
210 215 220 

Ser Pro Tyr Phe He Asn Ser Ser Lys Gly Ala Lys Val Glu Phe Gin 
225 230 235 240 

Asp Ala Leu Leu Leu Leu Ser Glu Lys Lys He Ser Ser Val Ala Glu 
245 250 255 

His His Ser Pro Leu Trp Arg Leu Ala Ser Arg Arg Thr Arg Lys Pro 
260 265 270 

Leu Val He He Ala Glu Asp He Asp Gly Glu Ala Leu Ser Thr Leu 
275 280 285 

Val Val Asn Arg Leu Lys He Gly Leu Gin Val Ala Ala Val Lys Ala 
290 295 300 

Pro Gly Phe Gly Asp Asn Arg Lys Ser Thr Leu Thr Asp Met Ala Thr 
305 310 315 320 



Ser Gly Gly He Val Phe Gly Asp Asp Val Ser Leu Val Lys Leu Glu 
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325 330 335 

Asp Val Lys Val Ser Asp Leu Gly Gin Val Gly Glu Val Val lie Thr 
340 345 350 

Lys Asp Asp Thr Leu Leu Leu Lys Gly Lys Gly Lys Lys Asp Asp Val 
355 360 365 

Leu Arg Arg Ala Asn Gin lie Arg Thr Lys lie Glu Asp Thr Thr Ser 
370 375 380 

Glu Tyr Glu Lys Glu Lys Leu Gin Glu Arg Leu Ala Arg Leu Ala Ser 
385 390 395 400 

Gly Val Ala Leu Arg Val Gly Gly Ser Ser Glu Val Glu Val Asn Glu 
405 410 415 

Lys Lys Asp Arg Val His Asp Ala Leu Asn Ala Thr Arg Ala Ala Val 
420 425 430 

Glu Glu Gly He Val Pro Gly Gly Gly Arg Pro Leu Leu Arg Cys He 
435 440 445 

Glu Lys Leu Glu Gly Val Glu Thr Thr Asn Glu Asp Gin Lys Leu Gly 
450 455 460 

Val Glu He Val Arg Arg Ala Leu Arg Met Pro Cys Met Thr He Ala 
465 470 475 480 

Lys Asn Ala Gly Val Asp Gly Ala Met Val Val Ala Lys Val Glu Asn 
485 490 495 

Gin Ala Gly Asp Tyr Gly Tyr Asp Ala Lys Gly Glu Tyr Gly Asn Leu 
500 505 510 

He Glu Lys Gly He He Asp Pro Thr Lys Val Val Arg Thr Ala He 
515 520 525 

Thr Asp Ala Ser Gly Val Ala Ser Leu Leu Thr Thr Ala Glu Ala Val 
530 535 ~ 540 

Val Thr Glu He Pro Lys Glu Asp Gly Ala Pro Ala Met Pro Gly Met 
545 550 555 560 

Gly Gly Met Gly Gly Met Gly Gly Met Gly Gly Met Gly Gly Met Met 
565 570 575 



(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 573 amino acids 

(B) TYPE: amino acid 

( C) STRAND EDNESS : 

(D) TOPOLOGY: linear 
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<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 

Met Leu Arg Leu Pro Thr Val Phe Arg Gin Met Arg Pro Val Ser Arg 
1 5 10 15 

Val Leu Ala Pro His Leu Thr Arg Ala Tyr Ala Lys Asp Val Lys Phe 
20 25 30 

Gly Ala Asp Ala Arg Ala Leu Met Leu Gin Gly Val Asp Leu Leu Ala 
35 40 45 

Asp Ala Val Ala Val Thr Met Gly Pro Lys Gly Arg Thr Val lie lie 
50 55 60 

Glu Gin Gly Trp Gly Ser Pro Lys Val Thr Lys Asp Gly Val Thr Val 
65 70 75 80 

Ala Lys Ser lie Asp Leu Lys Asp Lys Tyr Lys Asn lie Gly Ala Lys 
85 90 95 

Leu Val Gin Asp Val Ala Asn Asn Thr Asn Glu Glu Ala Gly Asp Gly 
100 105 HO 



Thr Thr Thr Ala Thr Val Leu Ala Arg Ser lie Ala Lys Glu Gly Phe 
115 120 125 

Glu Lys lie Ser Lys Gly Ala Asn Pro Val Glu He Arg Arg Gly Val 
130 135 140 

Met Leu Ala Val Asp Ala Val He Ala Glu Leu Lys Lys Gin Ser Lys 
145 150 155 160 

Pro Val Thr Thr Pro Glu Glu He Ala Gin Val Ala Thr He Ser Ala 
165 170 175 

Asn Gly Asp Lys Glu He Gly Asn He He Ser Asp Ala Met Lys Lys 
180 185 190 

Val Gly Arg Lys Gly Val He Thr Val Lys Asp Gly Lys Thr Leu Asn 
195 200 205 

Asp Glu Leu Glu He He Glu Gly Met Lys Phe Asp Arg Gly Tyr He 
210 215 220 

Ser Pro Tyr Phe He Asn Thr Ser Lys Gly Gin Lys Cys Glu Phe Gin 
225 230 235 240 



Asp Ala Tyr Val Leu Leu Ser Glu Lys Lys He Ser Ser He Gin Ser 
245 250 255 
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He Val Pro Ala Leu Glu He Ala Asn Ala His Arg Lys Pro Leu Val 
260 265 270 

He He Ala Glu Asp Val Asp Gly Glu Ala Leu Ser Thr Leu Val Leu 
275 280 285 

Asn Arg Leu Lys Val Gly Leu Gin Val Val Ala Val Lys Ala Pro Gly 
290 295 300 

Phe Gly Asp Asn Arg Lys Asn Gin Leu Lys Asp Met Ala He Ala Thr 
305 310 315 320 

Gly Gly Ala Val Phe Gly Glu Glu Gly Leu Thr Leu Asn Leu Glu Asp 
325 330 335 

Val Gin Pro His Asp Leu Gly Lys Val Gly Glu Val lie Val Thr Lys 
340 345 350 



\ 

\ 



Asp Asp Ala Met 
355 

Lys Arg He Gin 
370 



Leu Leu Lys Gly 
360 

Glu He He Glu 
375 



Lys Gly Asp Lys 

Gin Leu Asp Val 
380 



Ala Gin He Glu 
365 

Thr Thr Ser Glu 



Tyr Glu Lys Glu 
385 

Val Ala Val Leu 



Lys Lys Asp Arg 
420 

Glu Glu Gly He 
435 

Pro Ala Leu Asp 
450 

He Glu He He 
465 

Lys Asn Ala Gly 



Ser Ser Ser Glu 
500 

Met Val Glu Lys 
515 



Lys Leu Asn Glu 
390 

Lys Val Gly Gly 
405 

Val Thr Asp Ala 



Val Leu Gly Gly 
440 

Ser Leu Thr Pro 
455 

Lys Arg Thr Leu 
470 

Val Glu Gly Ser 
485 

Val Gly Tyr Asp 



Gly He He Asp 
520 



Arg Leu Ala Lys 
395 

Thr Ser Asp Val 
410 

Leu Asn Ala Thr 
425 

Gly Cys Ala Leu 



Ala Asn Glu Asp 
460 

Lys He Pro Ala 
475 

Leu He Val Glu 
490 

Ala Met Ala Gly 
505 

Pro Thr Lys Val 



Leu Ser Asp Gly 
400 

Glu Val Asn Glu 
415 

Arg Ala Ala Val 
430 

Leu Arg Cys He 
445 

Gin Lys He Gly 



Met Thr He Ala 
480 

Lys He Met Gin 
495 

Asp Phe Val Asn 
510 

Val Arg Thr Ala 
525 



Leu Leu Asp Ala Ala Gly Val Ala Ser Leu Leu Thr Thr Ala Glu Val 
530 535 540 

Val Val Thr Glu He Pro Lys Glu Glu Lys Asp Pro Gly Met Gly Ala 
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545 550 . 555 560 

Met Gly Gly Met Gly Gly Gly Met Gly Gly Gly Met Phe 
565 570 

(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 577 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 

Met Tyr Arg Phe Ala Ser Asn Leu Ala Ser Lys Ala Arg lie Ala Gin 
15 10 15 

Asn Ala Arg Gin Val Ser Ser Arg Met Ser Trp Ser Arg Asn Tyr Ala 
20 - 25 30 

Ala Lys Glu lie Lys Phe Gly Val Glu Ala Arg Ala Leu Met Leu Lys 
35 40 45 

Gly Val Glu Asp Leu Ala Asp Ala Val Lys Val Thr Met Gly Pro Lys 
50 55 60 

Gly Arg Asn Val. Val lie Glu Gin Ser Trp Gly Ala Pro Lys Val Thr 
65 70 75 80 

Lys Asp Gly Val Thr Val Ala Lys Ser lie Glu Phe Lys Asp Lys lie 
85 90 95 

Lys Asn Val Gly Ala Ser Leu Val Lys Gin Val Ala Asn Ala Thr Asn 
100 105 110 

Asp Val Ala Gly Asp Gly Thr Thr Cys Ala Thr Val Leu Thr Arg Ala 
115 120 125 

lie Phe Ala Glu Gly Cys Lys Ser Val Ala Ala Gly Met Asn Ala Met 
130 135 140 

Asp Leu Arg Arg Gly lie Ser Met Ala Val Asp Ala Val Val Thr Asn 
145 150 155 160 

Leu Lys Ser Lys Ala Arg Met lie Ser Thr Ser Glu Glu lie Ala Gin 
165 170 175 



Val Gly Thr lie Ser Ala Asn Gly Glu Arg Glu lie Gly Glu Leu lie 
180 185 190 



*5* >J,- ■ TJ *| 
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Ala Lys Ala Met Glu Lys Val Gly Lys Glu Gly Val lie Thr lie Gin 
195 200 205 

Asp Gly Lys Thr Leu Phe Asn Glu Leu Glu Val Val Glu Gly Met Lys 
210 215 220 

Leu Asp Arg Gly Tyr Thr Ser Pro Tyr Phe lie Thr Asn Gin Lys Thr 
225 230 235 240 

Gin Lys Cys Glu Leu Asp Asp Pro Leu lie Leu lie His Glu Lys Lys 
245 250 255 

lie Ser Ser lie Asn Ser lie Val Lys Val Leu Glu Leu Ala Leu Lys 
260 265 270 

Arg Gin Arg Pro Leu Leu lie Val Ser Glu Asp Val Glu Ser Asp Ala 
275 280 285 

Leu Ala Thr Leu lie Leu Asn Lys Leu Arg Ala Gly lie Lys Val Cys 
290 295 300 

Ala lie Lys Ala Pro Gly Phe Gly Glu Asn Arg Lys Ala Asn Leu Gin 
305 310 315 320 

Asp Leu Ala Ala Leu Thr Gly Gly Glu Val lie Thr Asp Glu Leu Gly 
325 330 335 

Met Asn Leu Glu Lys Val Asp Leu Ser Met Leu Gly Thr Cys Lys Lys 
340 345 350 

Val Thr Val Ser Lys Asp Asp Thr Val lie Leu Asp Gly Ala Gly Asp 
355 360 365 

Lys Lys Gly lie Glu Glu Arg Cys Glu Gin lie Arg Ser Ala lie Glu 
370 375 380 

Leu Ser Thr Ser Asp Tyr Asp Lys Glu Lys Leu Gin Glu Arg Leu Ala 
385 390 395 400 

Lys Leu Ser Gly Gly Val Ala Val Leu Lys lie Gly Gly Ala Ser Glu 
405 410 415 

Ala Glu Val Gly Glu Lys Lys Asp Arg Val Thr Asp Ala Leu Asn Ala 
420 425 430 

Thr Lys Ala Ala Val Glu Glu Gly lie Leu Pro Gly Gly Gly Val Ala 
435 440 445 

Leu Leu Tyr Ala Ala Arg Glu Leu Glu Lys Leu Pro Thr Ala Asn Phe 
450 455 460 

Asp Gin Lys lie Gly Val Gin lie lie Gin Asn Ala Leu Lys Thr Pro 
465 470 475 480 



Val Tyr Thr lie Ala Ser Asn Ala Gly Val Glu Gly Ala Val lie Val 
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485 490 495 

Gly Lys Leu Leu Glu Gin Asp Asn Pro Asp Leu Gly Tyr Asp Ala Ala 
500 505 510 

Lys Gly Glu Tyr Val Asp Met Val Lys Ala Gly lie lie Asp Pro . Leu 
515 520 525 

Lys Val lie Arg Thr Ala Leu Val Asp Ala Ala Ser Val Ser Ser Leu 
530 535 540 

Leu Thr Thr Thr Glu Ala Val Val Val Asp Leu Pro Lys Asp Glu Ser 
545 550 555 560 

Glu Ser Gly Ala Ala Gly Gly Gly Met Gly Gly Met Val Val Met Asp 
565 570 575 

Tyr 



(2) INFORMATION FOR SEQ ID NO: 32: . 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 576 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 

Met Tyr Arg Ala Ala Ala Ser Leu Ala Ser Lys Ala Arg Gin Ala Gly 
1 5 10 15 

Ser Ser Ser Ala Ala Arg Gin Val Gly Ser Arg Leu Ala Trp Ser Arg 
20 25 30 

Asn Tyr Ala Ala Lys Asp He Lys Phe Gly Val Glu Ala Arg Ala Leu 
35 40 45 

Met Leu Arg Gly Val Glu Glu Leu Ala Asp Ala Val Lys Val Thr Met 
50 55 60 

Gly Pro Lys Gly Arg Asn Val Val He Glu Gin Ser Phe Gly Ala Pro 
65 70 75 80 

Lys Val Thr Lys Asp Gly Val Thr Val Ala Lys Ser He Glu Phe Lys 
85 90 95 



Asp Arg Val Lys Asn Val Gly Ala Ser Leu Val Lys Gin Val Ala Asn 
100 105 HO 
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Ala Thr Asn Asp Asn Ala Gly Asp Gly Thr Thr Cys Ala Thr Val Leu 
115 120 125 

Thr Lys Ala lie Phe Thr Glu Gly Cys Lys Ser Val Ala Ala Gly Met 
130 135 140 

Asn Ala Met Asp Leu Arg Arg Gly lie Ser Met Ala Val Asp Ala Val 
145 150 155 160 

Val Thr Asn Leu Lys Gly Met Ala Arg Met lie Ser Thr Ser Glu Glu 
165 170 175 

lie Ala Gin Val Gly Thr lie Ser Ala Asn Gly Glu Arg Glu lie Gly 
180 185 190 

Glu Leu lie Ala Lys Ala Met Glu Lys Val Gly Lys Glu Gly Val lie 
195 200 205 

Thr lie Ala Asp Gly Asn Thr Leu Tyr Asn Glu Leu Glu Val Val Glu 
210 215 220 

Gly Met Lys Leu Asp Arg Gly Tyr lie Ser Pro Tyr Phe lie Thr Asn 
225 23.0 235 240 

Ser Lys Ala Gin Lys Cys Glu Pro Glu Asp Pro Leu lie Leu lie His 
245 250 255 

Asp Arg Lys Val Thr Asn Met His Ala Val Val Lys Val Leu Glu Met 
260 265 270 

Ala Leu Lys Lys Gin Arg Pro Leu Leu lie Val Ala Glu Asp Val Glu 
275 280 285 

Ser Glu Ala Leu Gly Thr Leu lie lie Asn Lys Leu Arg Ala Gly lie 
290 295 300 

Lys Val Cys Ala Val Lys Ala Pro Gly Phe Gly Glu Asn Arg Lys Ala 
305 310 315 320 

Asn Leu Gin Asp Leu Ala lie Leu Thr Gly Gly Glu Val lie Thr Glu 
325 330 335 

Glu Leu Gly Met Asn Leu Glu Asn Val Glu Pro His Met Leu Gly Ser 
340 345 350 

Cys Lys Lys Val Thr Val Ser Lys Asp Asp Thr Val lie Leu Asp Gly 
355 360 365 

Ala Gly Asp Lys Lys Ser lie Glu Glu Arg Ala Asp Gin lie Arg Ser 
370 375 380 

Ala Val Glu Asn Ser Thr Ser Asp Tyr Asp Lys Glu Lys Leu Gin Glu 
385 390 395 400 

Arg Leu Ala Lys Leu Ser Gly Gly Val Ala Val Leu Lys lie Gly Gly 
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405 410 415 

Ala Ser Glu Ala Glu Val Gly Glu Lys Lys Asp Arg Val Thr Asp Ala 
420 425 430 

Leu Asn Ala Thr Lys Ala Ala Val Glu Glu Gly lie Val Pro Gly Gly 
435 440 445 

Gly Val Ala Leu Leu Tyr Ala Ser Lys Glu Leu Asp Lys Leu Gin Thr 
450 455 460 

Ala Asn Phe Asp Gin Lys lie Gly Val Gin lie lie Gin Asn Ala Leu 
465 470 475 480 

Lys Thr Pro Val His Thr lie Ala Ser Asn Ala Gly Val Glu Gly Ala 
485 490 495 

Val Val Val Gly Lys Leu Leu Glu Gin Gly Asn Thr Asp Leu Gly Tyr 
500 505 510 

Asp Ala Ala Lys Asp Glu Tyr Val Asp Met Val Lys Ala Gly lie lie 
515 520 525 

Asp Pro Leu Lys Val lie Arg Thr Ala Leu Val Asp Ala Ala Ser Val 
530 535 540 

Ser Ser Leu Met Thr Thr Thr Glu Ser lie lie Val Glu lie Pro Lys 
545 550 555 560 

Glu Glu Ala Pro Ala Pro Ala Met Gly Gly Met Gly Gly Met Asp Tyr 
565 570 575 



(2) INFORMATION FOR SEQ ID NO:33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 587 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 

Met Ala Ser Thr Asn Ala Leu Ser Ser Thr Ser lie Leu Arg Ser Pro 
15 10 15 

Thr Asn Gin Ala Gin Thr Ser Leu Ser Lys Lys Val Lys Gin His Gly 
20 25 30 

Arg Val Asn Phe Arg Gin Lys Pro Asn Arg Phe Val Val Lys Ala Ala 

35 40 45 
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Ala Lys Asp lie Ala 
50 

Gly He Asp Lys Leu 
65 

Gly Arg Asn Val Val 
85 

Asp Gly Val Thr He 
100 

Asn Ala Gly Ala Ala 
115 

Ser Ala Gly Asp Gly 
130 

He Lys Leu Gly Leu 
145 

He Lys Lys Gly He 
165 



Phe Asp Gin His Ser Arg 
55 

Ala Asp Ala Val Gly Leu 
70 75 

Leu Asp Glu Phe Gly Ser 
90 

Ala Arg Ala He Glu Leu 
105 

Leu He Arg Glu Val Ala 
120 

Thr Thr Thr Ala Ser He 
135 

Leu Asn Val Thr Ser Gly 
150 155 

Asp Lys Thr Val Ala Ala 
170 



Ser Ala Met Gin Ala 
60 

Thr Leu Gly Pro Arg 
80 

Pro Lys Val Val Asn 
95 

Pro Asp Pro Met Glu 
110 

Ser Lys Thr Asn Asp 
125 

Leu Ala Arg Glu He 
140 

Ala Asn Pro Val Ser 
160 

Leu Val Glu Glu Leu 
175 



Glu Lys Leu Ala 
180 

Ala Thr He Ser 
195 

Glu Ala He Asp 
210 

Ser Asn Ser Phe 
225 

Asp Arg Gly Tyr 



He Val Glu Phe 
260 

Ser Ala He Lys 
275 



Arg Pro Val Lys 



Ala Gly Asn Asp 
200 

Lys Val Gly Pro 
215 

Glu Thr Thr Val 
230 

He Ser Pro Gin 
245 

Glu Asn Ala Arg 



Asp He He Pro 
280 



Gly Gly Asp Asp 
185 

Glu Leu He Gly 



Asp Gly Val Leu 
220 

Glu Val Glu Glu 
235 

Phe Val Thr Asn 
250 

Val Leu He Thr 
265 

Leu Leu Glu Lys 



He Lys Ala Val 
190 

Lys Met He Ala 
205 

Ser He Glu Ser 



Gly Met Glu He 
240 

Pro Glu Lys Ser 
255 

Asp Gin Lys He 
270 

Thr Thr Gin Leu 
285 



Arg Ala Pro Leu Leu He He Ser Glu Asp He Thr Gly Glu Ala Leu 

290 295 300 

Ala Thr Leu Val Val Asn Lys Leu Arg Gly He Leu Asn Val Ala Ala 
305 310 315 320 

He Lys Ala Pro Gly Phe Gly Glu Arg Arg Lys Ala Leu Leu Gin Asp 
325 330 335 
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lie Ala lie Leu Thr Gly Ala Glu Phe Gin Ala Ser Asp Leu Gly Leu 
340 345 350 

Leu Val Glu Asn Thr Thr He Glu Gin Leu Gly Leu Ala Arg Lys Val 
355 360 365 

Thr He Ser Lys Asp Ser Thr Thr He He Ala Asp Ala Ala Ser Lys 
370 375 380 

Asp Glu Leu Gin Ser Arg Val Ala Gin Leu Lys Lys Glu Leu Ser Glu 
385 390 395 400 

Thr Asp Ser He Tyr Asp Ser Glu Lys Leu Ala Glu Arg He Ala Lys 
405 410 415 

Leu Ser Gly Gly Val Ala Val He Lys Val Gly Ala Ala Thr Glu Thr 
420 425 430 

Glu Leu Glu Asp Arg Lys Leu Arg He Glu Asp Ala Lys Asn Ala Thr 
435 440 445 

Phe Ala Ala He Glu Glu Gly He Val Pro Gly Gly Gly Thr Ala Leu 
450 455 460 

Val His Leu Ser Gly Tyr Val Pro Ala He Lys Glu Lys Leu Glu Asp 
465 470 475 480 

Ala Asp Glu Arg Leu Gly Ala Asp He Val Gin Lys Ala Leu Val Ala 
485 490 495 

Pro Ala Ala Leu He Ala Gin Asn Ala Gly He Glu Gly Glu Val Val 
500 505 510 



Val Glu Lys He Lys Asn Gly Glu Trp GfiilVa.l Gly Tyr Asn Ala Met 
515 520 525 

Thr Asp Thr Tyr Glu Asn Leu Val Glu Ser Gly Val He Asp Pro Ala 
530 535 540 

Lys Val Thr Arg Cys Ala Leu Gin Asn Ala Ala Ser Val Ala Gly Met 
545 550 555 560 

Val Leu Thr Thr Gin Ala He Val Val Glu Lys Pro Lys Pro Lys Ala 
565 570 575 

Ala Val Ala Ala Ala Pro Gin Gly Leu Thr He 
580 585 

(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:34: 

Met Ala Lys Asp lie Lys Phe Gly Glu Glu Ala Arg Arg Ala Met Leu 
15 10 15 

Arg Gly Val Asn Ala Leu Ala Asp Ala Val Lys Val Thr Leu Gly Pro 
20 25 30 

Lys Gly Arg Asn Val Val Leu Glu Lys Ser Phe Gly Ala Pro Thr lie 
35 40 45 

Thr Lys Asp Gly Val Thr Val Ala Lys Glu lie Glu Leu Glu Asp Lys 
50 55 60 

Phe Glu Asn Met Gly Ala Gin Leu Val Lys Glu Val Ala Ser Lys Thr 
65 70 75 80 

Asn Asp Val Ala Gly Asp Gly Thr Thr Thr Ala Thr Val Leu Ala Gin 
85 90 95 

Ala lie Val Lys Glu Gly Leu Lys Asn Val Ala Ala Gly Ala Asn Pro 
100 105 110 

Met Asp Leu Arg Arg Gly lie Asp Lys Ala Val Asp Ala Val Val Glu 
115 120 125 

Glu Leu Lys Ala lie Ala Lys Pro Val Glu Thr Lys Glu Glu lie Ala 
130 135 140 

Gin Val Ala Thr lie Ser Ala Asn Gly Asp Glu Glu lie Gly Glu Leu 
145 150 155 160 

lie Ala Glu Ala Met Glu Lys Val Gly Lys Glu Gly Val lie Thr Val 
165 170 175 

Glu Glu Gly Lys Thr Leu Glu Thr Glu Leu Glu Val Val Glu Gly Met 
180 185 190 

Gin Phe Asp Arg Gly Tyr lie Ser Pro Tyr Phe lie Thr Asp Ser Glu 
195 200 205 

Lys Gin Lys Ala Glu Leu Glu Asp Pro Leu lie Leu Leu Thr Asp Lys 
210 215 220 

Lys lie Ser Asn He Gin Asp Leu Leu Pro Val Leu Glu Glu Val Ala 
225 230 235 240 

Gin Ala Gly Lys Pro Leu Leu He He Ala Glu Asp Val Glu Gly Glu 
245 250 255 
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Ala Leu Ala Thr Leu Val Val Asn Lys Leu Arg Gly Thr Leu Lys Val 
260 265 270 

Val Ala Val Lys Ala Pro Gly Phe Gly Asp Arg Arg Lys Ala Met Leu 
275 280 285 

Gin Asp lie Ala lie Leu Thr Gly Gly Gin Val lie Ser Glu Glu Leu 
290 295 300 

Gly Leu Ser Leu Glu Asp Ala Thr Leu Glu Asp Leu Gly Gin Ala Lys 
305 310 315 320 

Lys Val Val Val Thr Lys Asp Asp Thr Thr lie Val Asp Gly Ala Gly 
325 330 335 

Asp Ala Ala lie Ala Gly Arg Val Ala Gin He Arg Ser Gin He Glu 
340 345 350 

Glu Ser Thr Ser Asp Tyr Asp Lys Glu Lys Leu Gin Glu Arg Leu Ala 
355 360 365 

Lys Leu Ala Gly Gly Val Ala Val He Lys Val Gly Ala Ala Thr Glu 
370 375 380 

Val Glu Leu Lys Glu Arg Lys Asp Arg Val Glu Asp Ala Leu Asn Ala 
385 390 395 400 

Thr Arg Ala Ala Val Glu Glu Gly He Val Pro Gly Gly Gly Val Ala 
405 410 415 

Leu Leu Arg Ala Ala Pro Ala Leu Asp Lys Leu Lys Thr Glu Asn Gly 
420 425 430 

Asp Glu Ala Thr Gly Val Asn He Val Leu Arg Ala Leu Glu Ala Pro 
435 440 445 

Leu Arg Gin He Ala Glu Asn Ala Gly Leu Glu Gly Ser Val Val Val 
450 455 460 

Glu Lys Val Lys Asn Ser Glu Ala Gly Gly Tyr Asn Ala Ala Thr Gly 
465 470 475 480 

Glu Tyr Val Asp Met He Ala Ala Gly He He Asp Pro Thr Lys Val 
485 490 495 

Thr Arg Ser Ala Leu Gin Asn Ala Ala Ser Val Ala Ser Leu Met Leu 
500 505 510 

Thr Thr Glu Ala Val Val Val Asp Lys Pro Glu Lys Glu Ala Ala Pro 
515 520 525 

Ala Gly Met Pro Gly Met Met Gly Gly Met Gly Gly Met Gly Gly Met 
530 535 540 



Met 
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545 

(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 
CATATGGCNG CNAAAGAYGT AAAA 
(2) INFORMATION FOR SEQ ID NO:36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO 
TGATCACATC ATNCCNCCCA TNCC 
(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO 
CATATGGCAA AAGAAATHAA RTTY 
(2) INFORMATION FOR SEQ ID NO: 38: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:38: 
TGATCANCCN CCCATNCCNC CCAT 24 



(2) INFORMATION FOR SEQ ID NO: 39: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 
GTAAAACGAC GGCCAG 16 
(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:40: 
CAGGAAACAG CTATGAC 17 
(2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:41: 
CCAACCATCA CGAAAGA 17 
(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:42: 
ACGGGTCACT TTGGTTG 17 
(2) INFORMATION FOR SEQ ID NO: 43: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 
TTACTAATGA CGGGGTA 17 
(2) INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 
TTACCAATGA CGGTGTG 17 
(2) INFORMATION FOR SEQ ID NO: 45: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
<D> TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:45j 
ACAGGGTCAA TGATTCC 
(2) INFORMATION FOR SEQ ID NO: 46: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:46: 
ACTGGATCAA TGATACC 
(2) INFORMATION FOR SEQ ID NO: 47: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 
CCGTACCGTG CTCTGAC 17 
(2) INFORMATION FOR SEQ ID NO:48: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48: 
ACCACGTTTC AGATCCA 17 
(2) INFORMATION FOR SEQ ID NO: 49: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49: 
GACAGTTTCG CGGCAAC 17 
(2) INFORMATION FOR SEQ ID NO: 50: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : S ingle 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 
CTCAGAACGA AGATCAG 17 
(2) INFORMATION FOR SEQ ID NO: 51: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:51: 
GGTATGCAGT TCGACCG 17 y 



y 



y 

y 
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(2) INFORMATION FOR SEQ ID NO : 52 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:52: 
CCGTGTTGGT CAAATCC 17 
(2) INFORMATION FOR SEQ ID NO: 53: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53: 
GGTAACTACG GTTACAA 17 
(2) INFORMATION FOR SEQ ID NO: 54: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:54: 
GAGGCCACTT CTTTCAC 
(2) INFORMATION FOR SEQ ID NO: 55: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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<D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 
GGCTTCCAGC ACTGGCA 17 
(2) INFORMATION FOR SEQ ID NO: 56: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56: 
AACTTCAGTC GCAGCAC 17 
(2) INFORMATION FOR SEQ ID NO: 57: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57: 
CCTTGAAAGC CATTGCT 17 
(2) INFORMATION FOR SEQ ID NO: 58: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:58: 
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GCTACACGTG CAGCCGT 
(2) INFORMATION FOR SEQ ID NO: 59: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 59 
GCTGCAACAG GTGAGTG 
(2) INFORMATION FOR SEQ ID NO: 60: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 60 
TCATGAACAA TGGCTTG 
(2) INFORMATION FOR SEQ ID NO: 61: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 61 
ACGAAGCACA ATGTTAC 
(2) INFORMATION FOR SEQ ID NO:62: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 17 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 62: 
ATCACTAAAG ATGGTGT I 7 
(2) INFORMATION FOR SEQ ID NO:63: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63: 

GCAGTTGCCG CAGCAGT 17 

(2) INFORMATION FOR SEQ ID NO: 64: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 17 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64: 
GCTACTCGTG CAGCTGT 17 
(2) INFORMATION FOR SEQ ID NO: 65: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



